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Vaccine 

Field of the Invention 

5 This invention relates to nucleic acid constructs, vectors comprising such constructs, 
methods of preparing the vectors and constructs and their use in prophylaxis or 
therapy, in particular therapeutic vaccines. The invention further relates to host cells 
comprising the constructs and vectors and to polypeptides encoded by the constructs 
as well as to the polypeptides per se. The invention further relates to pharmaceutical 
1 0 formulations comprising the constructs and vectors and to the use of the constructs 

and vectors in medicine. The invention relates in particular to DNA vaccines that are 
useful in the prophylaxis and treatment of HIV infections, more particularly when 
administered by particle mediated delivery. 

1 5 Background to the Invention 

HIV-1 is the primary cause of the acquired immune deficiency syndrome (AIDS) 
which is regarded as one of the world's major health problems. Although extensive 
research throughout the world has been conducted to produce a vaccine, such efforts 
20 thus far have not been successful. 

The HIV envelope glycoprotein gpl20 is the viral protein that is used for attachment 
to the host cell. This attachment is mediated by binding to two surface molecules of 
helper T cells and macrophages, known as CD4 and one of the two chemokine 
25 receptors CCR-4 or CXCR-5. The gpl20 protein is first expressed as a larger 

precursor molecule (gpl60), which is then cleaved post-translationally to yield gpl20 
and gp41. The gpl20 protein is retained on the surface of the virion by linkage to the 
gp41 molecule, which is inserted into the viral membrane. 

30 The gpl20 protein is the principal target of neutralizing antibodies, but unfortunately 
the most immunogenic regions of the proteins (V3 loop) are also the most variable 
parts of the protein. Therefore, the use of gp l20 (or its precursor gpl60) as a vaccine 
antigen to elicit neutralizing antibodies is thought to be of limited use for a broadly 

1 
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protective vaccine. The gpl20 protein does also contain epitopes that are recognized 
by cytotoxic T lymphocytes (CTL). These effector cells are able to eliminate virus- 
infected cells, and therefore constitute a second major antiviral immune mechanism. 
Li contrast to the target regions of neutralizing antibodies some CTL epitopes appear 
5 to be relatively conserved among different HIV strains. For this reason gpl20 and 
gpl60 maybe useful antigenic components in vaccines that aim at eliciting cell- 
mediated immune responses (particularly CTL). 

Non-envelope proteins of HIV- 1 have been described and include for example internal 
10 structural proteins such as the products of the gag and pol genes and other non- 
structural proteins such as Rev, Nef, Vif and Tat (Green et al., New England J. Med, 
324, 5, 308 et seq (1991) and Bryant et al (Ed. Pizzo), Pediatr. Infect. Dis. J., 11, 5, 
390 et seq (1992). 

15 HIV Tat and Nef proteins are early proteins, that is they are expressed early in 
infection and in the absence of structural protein. 

The Nef protein is known to cause the removal of CD4, the HIV receptor, from the 
cell surface, but the biological importance of this function is debated. Additionally 
20 Nef interacts with the signal pathway of T cells and induces an active state, which in 
turn may promote more efficient gene expression. Some HIV isolates have mutations 
in this region, which cause them not to encode functional protein and are severely 
compromised in their replication and pathogenesis in vivo. 

25 The Tat gene gives rise to a number of differentially spliced transcripts at different 
times during infection. The first exon encodes an 86 amino acid protein which 
dominates early in infection. The second exon encodes an additional 14 amino acids, 
and this partially spliced form of Tat is found late in infection. Both forms are fully 
functional transactivators, but the longer form also contains an RGD motif important 

30 for binding to a v p 3 and a 5 Pi integrins. Tat binds to a short-stem loop structure, known 
as the transactivation response element (TAR), that is located at the 5 f terminus of 
HIV RNAs, and up-regulates transcription from the HIV LTR at least 1000-fold. Tat 
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has a role in promoting the elongation phase of HIV infection and stimulates the 
production of full-length viral transcripts. Tat can affect the expression of a number of 
cellular genes and can activate the expression of a number of cellular genes including 
TNF, IL-2 and IL-6, and regulates expression of p53 and Bcl-2. Tat is produced in 
5 excess and is secreted from infected cells. This extra-cellular Tat can enter other cells 
and may prime cells for infection by HIV or accelerate the rate of HIV replication in 
newly infected cells. 

In a conference presentation (C. David Pauza, Immunization with Tat toxoid 
10 attenuates SHIV89.6PD infection in rhesus macaques, 12 th Cent Gardes meeting, 

Marnes-La-Coquette, 26.10.1999), experiments were described in which rhesus 

macaques were immunised with Tat toxoid alone or in combination with an envelope 
1 glycoprotein gpl60 vaccine combination (one dose recombinant vaccinia virus and 

one dose recombinant protein). The results observed showed that the presence of the 
1 5 envelope glycoprotein gave no advantage over experiments performed with Tat alone. 

The Gag gene is translated from the full-length RNA to yield a precursor polyprotein 
which is subsequently cleaved into 3-5 capsid proteins; the matrix protein, capsid 
protein and nucleic acid binding protein and protease. (1 . Fundamental Virology, 
20 Fields BN, Knipe DM and Howley M 1996 2. Fields Virology vol 2 1996). 

The gag gene gives rise to the 55-kilodalton (kD) Gag precursor protein, also called 
p55, which is expressed from the unspliced viral mRNA. During translation, the N 
terminus of p55 is myristoylated, triggering its association with the cytoplasmic aspect 

25 of cell membranes. The membrane-associated Gag polyprotein recruits two copies of 
the viral genomic RNA along with other viral and cellular proteins that triggers the 
budding of the viral particle from the surface of an infected cell. After budding, p55 
is cleaved by the virally encoded protease (a product of the pol gene) during the 
process of viral maturation into four smaller proteins designated MA (matrix [pi 7]), 

30 CA (capsid [p24]), NC (nucleocapsid [p9]), and p6.(4). 



In addition to the 3 major Gag proteins, all Gag precursors contain several other 
regions, which are cleaved out and remain in the virion as peptides of various sizes 

3 
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These proteins have different roles e.g. the p2 protein has a proposed role in regulating 
activity of the protease and contributes to the correct timing of proteolytic processing. 

The MA polypeptide is derived from the N-terminal, myristoylated end of p55. Most 
5 MA molecules remain attached to the inner surface of the virion lipid bilayer, 

stabilizing the particle. A subset of MA is recruited inside the deeper layers of the 
virion where it becomes part of the complex which escorts the viral DNA to the 
nucleus. These MA molecules facilitate the nuclear transport of the viral genome 
because a karyophilic signal on MA is recognized by the cellular nuclear import 
10 machinery. This phenomenon allows HIV to infect non-dividing cells, an unusual 
property for a retrovirus. 

The p24 (CA) protein forms the conical core of viral particles. Cyclophilin A has 
been demonstrated to interact with the p24 region of p55 leading to its incorporation 
15 into HIV particles. The interaction between Gag and cyclophilin A is essential 

because the disruption of this interaction by cyclosporin A inhibits viral replication. 

The NC region of Gag is responsible for specifically recognizing the so-called 
packaging signal of HIV. The packaging signal consists of four stem loop structures 
20 located near the 5 ! end of the viral RNA, and is sufficient to mediate the incorporation 
of a heterologous RNA into HIV-1 virions. NC binds to the packaging signal through 
interactions mediated by two zinc-fmger motifs. NC also facilitates reverse 
transcription. 

25 The p6 polypeptide region mediates interactions between p55 Gag and the accessory 
protein Vpr, leading to the incorporation of Vpr into assembling virions. The p6 
region also contains a so-called late domain which is required for the efficient release 
of budding virions from an infected cell. 

30 The Pol gene encodes two proteins containing the two activities needed by the virus in 
early infection, the RT and the integrase protein needed for integration of viral DNA 
into cell DNA. The primary product of Pol is cleaved by the virion protease to yield 
the amino terminal RT peptide which contains activities necessary for DNA synthesis 

4 
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(RNA and DNA directed DNA polymerase, ribouclease H) and carboxy terminal 
integrase protein. HIV RT is a heterodimer of full-length RT (p66) and a cleavage 
product (p51) lacking the carboxy terminal Rnase integrase domain. 

5 RT is one of the most highly conserved proteins encoded by the retroviral genome. 
Two major activities of RT are the DNA Pol and Ribonuclease H. The DNA Pol 
activity of RT uses RNA and DNA as templates interchangeably and like all DNA 
polymerases known is unable to initiate DNA synthesis de novo, but requires a pre 
existing molecule to serve as a primer (RNA). 

10 

The Rnase H activity inherent in all RT proteins plays the essential role early in 
replication of removing the RNA genome as DNA synthesis proceeds. It selectively 
degrades the RNA from all RNA - DNA hybrid molecules. Structurally the 
polymerase and ribo H occupy separate, non-overlapping domains with the Pol 
1 5 covering the amino two thirds of the Pol. 

The p66 catalytic subunit is folded into 5 distinct subdomains. The amino terminal 23 
of these have the portion with RT activity. Carboxy terminal to these is the Rnase H 
Domain. 

20 

After infection of the host cell, the retroviral RNA genome is copied into linear ds 
DNA by the reverse transcriptase that is present in the infecting particle. The 
integrase (reviewed in Skalka AM 5 99 Adv in Virus Res 52 271-273) recognises the 
ends of the viral DNA, trims them and accompanies the viral DNA to a host 

25 chromosomal site to catalyse integration. Many sites in the host DNA can be targets 
for integration. Although the integrase is sufficient to catalyse integration in vitro, it 
is not the only protein associated with the viral DNA in vivo - the large protein - viral 
DNA complex isolated from the infected cells has been denoted the pre integration 
complex. This facilitates the acquisition of the host cell genes by progeny viral 

30 genomes. 
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The integrase is made up of 3 distinct domains, the N terminal domain, the catalytic 
core and the C terminal domain. The catalytic core domain contains all of the 
requirements for the chemistry of polynucleotidyl transfer. 

5 DNA vaccines usually consist of a bacterial plasmid vector into which is inserted a 
strong promoter, the gene of interest which encodes an antigenic peptide and a 
polyadenylation/transcriptional termination sequence. The gene of interest may 
encode a full protein or simply an antigenic peptide sequence relating to the pathogen, 
tumour or other agent which it is intended to protect against. The plasmid can be 

10 grown in bacteria, such as for example E.coli and then isolated and prepared in an 
appropriate medium, depending upon the intended route of administration, before 
being administered to the host. Following administration the plasmid is taken up by 
cells of the host, or delivered directly into the host cells, where the encoded peptide is 
produced. The plasmid vector will preferably be made without an origin of replication 

1 5 functional in eukaryotic cells, in order to prevent plasmid replication in the 

mammalian host and integration within chromosomal DNA of the animal concerned. 

There are a number of advantages of DNA vaccination relative to traditional 
vaccination techniques. First, it is predicted that because the proteins that are encoded 

20 by the DNA sequence are synthesised in the host, the structure or conformation of the 
protein will be similar to the native protein associated with the disease state. It is also 
likely that DNA vaccination will offer protection against different strains of a virus, 
by generating a cytotoxic T lymphocyte response that recognises epitopes from 
conserved proteins. The technology also offers the possibility of combining diverse 

25 immunogens into a single preparation to facilitate simultaneous immunisation in 
relation to a number of disease states. 

Helpful background information in relation to DNA vaccination is provided in 
Donnelly et al "DNA vaccines" Ann. Rev Immunol. 1997 15: 617-648, the disclosure 
3 0 of which is included herein in its entirety by way of reference. 



Doe et al (1994) Eur J Immunol, 24: 2369-2376 investigated how variations in 
glycosylation affected the CD8+ CTL response to gpl20 and found that gpl20 
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produced in mammalian CHO cells had a reduced ability to prime CTL resoponses 
when compared with insect or yeast cell-derived envelope proteins unnless N-linked 
oligosaccharides were removed prior to immunization. 

5 It has now been discovered that there are benefits to be gained by employing a 

polynucleotide encoding a non-glycosylated HIV envelope protein in a vaccine for 
HIV. Surprisingly, a DNA vector expressing gpl20 without a secretion signal and 
which is thus not glycosylated or secreted from the cell is a more effective stimulator 
of CTL responses than a DNA vector expressing gpl20 with its native secretion 

1 0 signal. Since the secretion signal is responsible for directing the gpl20 to the 
intracellular site where glycosylation takes place, gpl20 which lacks its native 
secretion signal is not glycosylated. Moreover, with the presence of a non-structural 
HIV protein such as Tat in a fusion protein with the non-glycosylated gpl20, CTL 
responses to the gp 120 are augmented, hi contrast, Tat in a fusion protein with 

15 normal gpl20 prevents secretion but does not result in an augmented immune 

response. The non-glycosylated gpl20 can also be successfully expressed in a fusion 
protein with other HIV antigens, both structural atid non-structural. 

Summary of the Invention 

20 

The present invention therefore provides novel constructs for use in nucleic acid or 
polypeptide vaccines for the prophylaxis and treatment of HIV infections and AIDS. 
In one aspect the invention provides a polynucleotide which comprises a sequence 
encoding an HIV envelope protein or fragment or immunogenic derivative thereof 
25 which is non- or substantially non-glycosylated when expressed in a mammalian 
target cell, operably linked to a heterologous promoter. 

Preferably the HIV envelope protein is gp 120. Alternatively it may be other forms of 
the envelope protein such as gpl60 or gpl40. 

30 

HIV envelope proteins such as gpl20 expressed in a mammalian cell will normally be 
glycosylated. According to the present invention the gpl20 encoding sequence is 
adapted to reduce or prevent glycosylation in a mammalian target cell, particularly a 

7 
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human target cell. Glycosylation may be reduced or prevented in a number of 
different ways, for example by removal of or mutation of the glycosylation sites or by 
removing the native secretion signal. Preferably in the polynucleotide construct 
according to the invention the gpl20 or other form of HIV envelope protein lacks a 
5 functional secretion signal. The secretion signal may vary in length between HIV 
isolates, for example it is 30 amino acids long in the W61D isolate described herein, 
but may be more or less than that for different isolates. Generally the secretion signal 
is clearly delineated and will be removed in its entirety, although this is not 
necessarily the case. A sufficient amount of the signal will be removed to prevent its 
1 0 function of taking the envelope protein to the cellular machinery responsible for 
glycosylation. This can be easily tested for. 

It will be understood that although the envelope protein does not have a functional 
secretion signal, there may still be a small amount of glycosylation taking place and 
1 5 this is not excluded from the invention. Thus the envelope protein according to the 
invention is substantially non-glycosylated in mammalian cells such that the majority 
of the envelope protein, for example greater than 50%, or preferably greater than 75%, 
or more preferably greater than 90%, or most preferably 95% or more of the envelope 
protein is not secreted from the cells. 

20 

The HIV envelope protein, particularly gpl20 is preferably expressed as a fusion 
protein, preferably comprising at least one HIV protein selected from a non-structural 
protein such as Nef, Tat, and reverse transcriptase (RT) and a structural protein in 
particular a capsid protein such as Gag, or a fragment or immunogenic derivative of 
25 any of these. 

In one embodiment the fusion protein is a gp 120 and RT-containing fusion protein, 
optionally also comprising Gag and/or Nef. 

30 In another embodiment the fusion protein is a gp 120 and Gag-containing fusion 
protein optionally also comprising RT and/or Nef. 
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In a further embodiment the fusion protein is a gp 120 and Nef-containing fusion 
protein optionally also comprising RT and/or Gag. 



In the following preferred embodiments the fusion protein is a fusion of gpl20 ? RT, 
5 Nef and Gag or fragments or immunogenic derivatives thereof: 

gpl20-RT-Nef-Gag 
RT-Nef-Gag-gpl20 

Another embodiment is a fusion comprising gpl20 and Tat or fragments or 
1 0 immunogenic derivatives thereof In such an embodiment the polynucleotide 

according to the invention comprises a gpl20 encoding sequence linked to a Tat 
encoding sequence to encode a gpl20 and Tat-containing fusion protein. 

In a particular embodiment the gpl20 and Tat sequence is further linked to a Nef 
15 encoding sequence to encode a gpl20, Tat and Nef-containing fusion protein, most 
preferably a gpl20-Nef-Tat fusion. 

Additional HIV sequences may be included such as a Gag encoding sequence. 

20 In another particular embodiment the fusion encoded by the polynucleotide according 
to the invention is a gp 1 20-Gag-Nef-Tat fusion. 

Preferably the Tat sequence for use in the invention is mutated so that it encodes a 
biologically inactive Tat which lacks transactivation activity but which maintains its 

25 immunogenic epitopes. Tat transactivation activity can be measured for example by 
an HIV LTR reporter system such as a CAT assay system in which the 
chloramphenicol-acetyl transferase reporter gene is under the control of the long 
terminal repeat of HIV- 1. One specific CAT assay which is suitable uses the HL3T1 
cell line which is described in Felber and Pavlakis (1988) Science 239: 184-187 and 

30 also used in Mischiati et al (2001) Antisense Nucleic Acid Drug Dev Aug 1 1 (4): 209- 
17. HL3T1 is a HeLa cell line which contains stably integrated silent copies of HIV- 1 
LTR promoter linked to the CAT gene. These cells are transfected with DNA vectors 
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containing the Tat gene, modified or not. 
Tat and measured by ELIS A. 
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CAT is produced upon presence of an active 



One preferred mutated Tat sequence (originating from BH10 molecular clone) bears 
5 mutations in the active site region (Lys41— >Ala) and in the RGD motif (Arg78^Lys 
and Asp80-^Glu) (Virology 235: 48-64, 1997). 

Optionally the Nef sequence for use in the invention is truncated to remove the 
sequence encoding the N terminal region i.e. removal of 30-85, preferably 60-85, 

1 0 preferably the N terminal 65 amino acids (the latter truncation is referred to herein as 
trNef). Advantageously the Nef may be modified to remove one or more 
myristylation sites. For example the Gly 2 myristylation site may be removed by 
deletion or substitution. Alternatively or additionally the Nef may be modified to alter 
the dileucine motif of Leu 174 and Leu 175 by deletion or substitution of one or both 

15 leucines. The importance of the dileucine motif in CD4 downregulation is described 
e.g. in Bresnahan P.A. et al (1998) Current Biology, 8(22): 1235-8. 

Preferably the Gag for use in the invention does not encode the Gag p6 polypeptide. 
Preferred Gag sequences for use in the invention comprise P17 and/or 24. 

20 

The RT polynucleotide for use in the invention preferably encodes a mutation to 
substantially inactivate any reverse transcription activity. A preferred inactive mutant 
involves the substitution of W tryptophan 229 for K lysine. See WO 03/025003. 

25 Preferably one or more of the HIV sequences included in the polynucleotide according 
to the invention encoding e.g. gpl20, Nef, Tat, Gag or RT is or are codon optimised 
for mammalian cells, most preferably such that it/they resemble a highly expressed 
human gene in their codon use. 

30 The fusion may contain further HIV sequences. It will be understood that for all of 
the HIV sequences included in the invention, these do not necessarily represent 
sequences encoding the full length or native proteins. Immunogenic derivatives such 
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as truncated or otherwise altered e.g. mutated proteins are also contemplated, as are 
fragments which encode at least one HIV epitope, preferably a CTL epitope, typically 
a peptide of at least 8 amino acids. Immunogenic derivatives of the fragments are also 
contemplated. Immunogenic derivatives cross react with the native antigen or 
5 fragment thereof. Polynucleotides which encode a fragment of at least 8, for example 
8-10 amino acids or up to 20, 50, 60, 70, 100, 150 or 200 amino acids in length are 
considered to fall within the scope of the invention as long as the encoded oligo or 
polypeptide demonstrates HIV antigenicity. The HIV polypeptide molecules encoded 
by the polynucleotide sequences according to the invention preferably represent a 

10 fragment of at least 50% of the length of the native protein, which fragment may 

contain mutations but which retains at least one HIV epitope and demonstrates HIV 
antigenicity. Similarly, immunogenic derivatives according to the invention must 
demonstrate HIV antigenicity. Preferred immunogenic derivatives provide some 
potential advantage over the native protein such as reduction or removal of a function 

15 of the native protein which is undesirable in a vaccine antigen such as an enzyme 
activity (RT), transactivating activity (Tat), or CD4 downregulation (Nef). The 
polynucleotide sequences are preferably codon optimised for mammalian cells, in line 
with preferred aspects of the invention. 

20 Preferred polynucleotide sequences are selected from the group: 



25 



30 
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codon optimised, 


minus 


secretion 


signal 
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13. gpl20 codon optimised, minus secretion signal - pl7/24 Gag - mLlNef-mTat 

14. gpl20 codon optimised, minus secretion signal - p 1 7/24 Gag - mL2Nef-mTat 

15. gpl20 codon optimised, minus secretion signal - mRT- trNef - pi 7/24 Gag 

16. mRT - trNef - pi 7/24 Gag - gpl20 codon optimised, minus secretion signal 



mNef — deletion of G2 to give non-myristoylated Nef 
LI Nef = LI 74 A mutation in Nef 
L2Nef = LI 75 A mutation in Nef 
LLNef = L174A and LI 75 A mutations in Nef 
10 TrNef = Nef devoid of nucleotides encoding terminal amino acids 1-65 

mRT = Reverse Transcriptase mutated to remove biological activity (W229K). RT is 

codon optimised 

Gag = Gag codon optimised. 



1 5 The invention preferably relates to HIV- 1 . It is preferred that the constructs described 
herein are derived from an HIV clade B or clade C, particularly clade B. 



Preferably the promoter is the promoter from HCMV IE gene, more particularly 
wherein the 5 ' untranslated region of the HCMV IE gene comprising exon 1 is 
20 included as described in WO 02/36792. 



In another aspect the invention provides a vector comprising the polynucleotide 
sequences described herein. The polynucleotide sequence is preferably DNA and is 
preferably contained within a vector which is a double stranded DNA plasmid . 
25 Alternative vectors are described hereinbelow and include in particular adenovirus 

vectors such as chimp derived adenovirus vectors Pan 9 or Pan 5, 6 and 7, preferably 
where these are replication defective such that they cannot replicate in the target cells. 

Another aspect of the invention relates to a set of polynucleotides encoding an HIV 
30 envelope protein or fragment or immunogenic derivative thereof, particularly gpl20, 
and separately one or more other HIV proteins such as RT, Gag, Nef, Tat or fragments 
or immunogenic derivatives thereof. 
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Thus the invention provides a set of polynucleotides comprising a polynucleotide as 
described herein for gpl20 or another form of the HIV envelope protein, and at least 
one further polynucleotide encoding at least one of HIV Nef, Gag, RT or Tat or 
fragment or immunogenic derivative thereof, optionally linked to an additional 
promoter. 

Preferably the set of polynucleotides is contained on a single vector, in particular a 
DNA plasmid, although other vectors such as adenovirus as described herein are also 
contemplated. Preferably the polynucleotides are under the control of two or more 
separate promoters. Alternatively the two polynucleotides may be under the control of 
a single promoter, and optionally translation of the second polynucleotide may be 
enhanced by inclusion of an internal ribosome entry site between the polynucleotides. 

Preferably the set of polynucleotides comprises a gpl20 polynucleotide as described 
herein and a further polynucleotide encoding a fusion of RT-Nef-Gag or of fragments 
or immunogenic derivatives thereof. 

Preferred sets of polynucleotides are selected from the group: 

1 . gp 120 codon optimised, minus secretion signal + tr Nef-mTat 

2. gpl20 codon optimised, minus secretion signal + PI 7/24 Gag - tr Nef 

3. gpl20 codon optimised, minus secretion signal + PI 7/24 Gag - Nef - mTat 

4. m RT - tr Nef - PI 7/24 Gag + gpl20 codon optimised, minus secretion signal 

5. gpl20 codon optimised, minus secretion signal + mRT - tr Nef - PI 7/24 Gag 

Where these preferred sets of two polynucleotides are contained on a single vector 
under the control of two separate promoters, this may be referred to as a dual promoter 
vector. 

The preferred promoter for one or both of the protein or fusion proteins in a set of 
polynucleotides according to the invention is the promoter from HCMV IE gene, more 
particularly wherein the 5 ' untranslated leader sequence comprises exon 1 as 
described in WO 02/36792, as described herein. 
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Ih yet another aspect the invention provides polypeptides encoded by the 
polynucleotides and vectors described herein. 

5 In one embodiment the invention provides a fusion protein comprising substantially 
non-glycosylated HIV envelope protein or a fragment or immunogenic derivative 
thereof and at least one additional HIV protein or fragment or immunogenic derivative 
thereof said additional HIV protein selected from Nef, Gag, RT and Tat. 

10 In another embodiment the invention provides a composition comprising a 

substantially non-glycosylated HIV envelope protein or a fragment or immunogenic 
derivative thereof and at least one additional HIV protein or fragment or immunogenic 
derivative thereof said at least one additional HIV protein selected from Nef 5 Gag, RT 
and Tat. Preferably the at least one additional HIV protein is a fusion protein of two 

15 or more of Nef, Gag, RT and Tat or fragments or immunogenic derivatives thereof. 

In yet another embodiment the invention provides a substantially non-glycosylated 
HIV envelope protein or a fragment or immunogenic derivative thereof, expressed 
from a polynucleotide which is codon optimised for mammalian cells. 

20 

In a further aspect the invention provides pharmaceutical compositions comprising the 
nucleotide sequences and vectors and polypeptides described herein, together with a 
pharmaceutical!/ acceptable excipient, diluent, carrier or adjuvant. In a preferred 
embodiment the polynucleotide, preferably in the form of a DNA vector and 
25 preferably comprising at least one codon optimised HIV sequence, is present in a 

composition comprising a plurality of particles, preferably beads such as gold beads, 
onto which the DNA is coated. 

Delivery of polynucleotides according to the invention is preferably carried out by 
30 particle mediated delivery, particularly via a bombardment approach. 

It is envisaged that the vectors according to the invention may be utilised with 
immuno stimulatory agents, preferably but not necessarily administered at the same 
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time as the vectors and preferably formulated together in the compositions according 
to the invention. 



Immunostimulatory agents for use in the invention include, but this list is by no means 
5 exhaustive and does not preclude other agents: synthetic imidazoquinolines such as 
imiquimod [S-26308, R-837], (Harrison, et al. 'Reduction of recurrent HSV disease 
using imiquimod alone or combined with a glycoprotein vaccine 5 , Vaccine 19: 1820- 
1826, (2001)); and resiquimod [S-28463, R-848] (Vasilakos, et al. ' Adjuvant activites 
of immune response modifier R-848: Comparison with CpG ODN\ Cellular 
10 immunology 204: 64-74 (2000).), Schiff bases of carbonyls and amines that are 
constitutively expressed on antigen presenting cell and T-cell surfaces, such as 
tucaresol (Rhodes, J. et al. ' Therapeutic potentiation of the immune system by 
costimulatory Schiff-base-forming drugs', Nature 377: 71-75 (1995)), cytokine, 
chemokine and co-stimulatory molecules as either protein or peptide or DNA, this 
15 would include pro-inflammatory cytokines such as GM-CSF, IL-1 alpha, IL-1 beta, 
TGF- alpha and TGF - beta, Thl inducers such as interferon gamma, IL-2, IL-1 2, IL- 
15 and IL-18, Th2 inducers such as IL-4, IL-5, IL-6, IL-10 and IL-13 and other 
chemokine and co-stimulatory genes such as MCP-1, MIP-1 alpha, MIP-1 beta, 
RANTES, TCA-3, CD80, CD86 and CD40L, other immunostimulatory targeting 
20 ligands such as CTLA-4 and L-selectin, apoptosis stimulating proteins and peptides 
such as Fas, (49), synthetic lipid based adjuvants, such as vaxfectin, (Reyes et al, 
' Vaxfectin enhances antigen specific antibody titres and maintains Thl type immune 
responses to plasmid DNA immunization', Vaccine 19: 3778-3786) squalene, alpha- 
tocopherol, polysorbate 80, DOPC and cholesterol, endotoxin, [LPS], Beutler, B., 
25 'Endotoxin, 'Toll-like receptor 4, and the afferent limb of innate immunity', Current 
Opinion in Microbiology 3: 23-30 (2000)) ; CpG oligo- and di-nucleotides, Sato, Y. et 
al., 'Immunostimulatory DNA sequences necessary for effective intradermal gene 
immunization', Science 273 (5273): 352-354 (1996). Hemmi, H. et al., 'A Toll-like 
receptor recognizes bacterial DNA', Nature 408: 740-745, (2000) and other potential 
30 ligands that trigger Toll receptors to produce appropriate Thl -inducing cytokines, 
such as synthetic Mycobacterial lipoproteins, Mycobacterial protein pi 9, 
peptidoglycan, teichoic acid and lipid A. 
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A preferred immunostimulatory agent for use with the invention is GM-CSF. This 
maybe employed in the form of a polynucleotide expressing GM-CSF which is co- 
administered with the DNA vaccine of the invention. A DNA plasmid encoding GM- 
CSF may be present in a pharmaceutical composition comprising the 
5 polynucleotide(s) according to the invention. 

Certain preferred adjuvants for eliciting a predominantly Thl-type response include, 
for example, a Lipid A derivative such as monophosphoryl lipid A, or preferably 3-de- 
O-acylated monophosphoryl lipid A. MPL® adjuvants are available from Corixa 

10 Corporation (Seattle, WA; see, for example, US Patent Nos. 4,436,727; 4,877,61 1; 
4,866,034 and 4,912,094). CpG-containing oligonucleotides (in which the CpG 
dinucleotide is unmethylated) also induce a predominantly Thl response. Such 
oligonucleotides are well known and are described, for example, in WO 96/02555, 
WO 99/33488 and U.S. Patent Nos. 6,008,200 and 5,856,462. Immunostimulatory 

15 DNA sequences are also described, for example, by Sato et al., Science 273:352, 

1996, Another preferred adjuvant comprises a saponin, such as Quil A, or derivatives 
thereof, including QS21 and QS7 (Aquila Biopharmaceuticals Inc., Framingham, 
MA); Escin; Digitonin; or Gypsophila or Chenopodium quinoa saponins. 

20 According to a further aspect of the invention, a host cell comprising a polynucleotide 
sequence according to the invention, or an expression vector according to the 
invention, is provided. The host cell may be for example bacterial e.g. E. coli, 
mammalian e.g. human, or may be an insect cell. Mammalian cells comprising a 
vector according to the present invention may be cultured cells transfected in vitro or 

25 may be cells transfected in vivo by administration of the vector to the mammal. 

By codon optimisation is meant that the DNA sequence is optimised to resemble the 
codon usage of genes in mammalian cells. In particular, the codon usage in the 
sequence is optimised to resemble that of highly expressed human genes. 

30 

The DNA code has 4 letters (A, T, C and G) and uses these to spell three letter 
"codons" which represent the amino acids the proteins encoded in an organism's 
genes. The linear sequence of codons along the DNA molecule is translated into the 
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linear sequence of amino acids in the protein(s) encoded by those genes. The code is 
highly degenerate, with 61 codons coding for the 20 natural amino acids and 3 codons 
representing "stop" signals. Thus, most amino acids are coded for by more than one 
codon - in fact several are coded for by four or more different codons. 

5 

Where more than one codon is available to code for a given amino acid, it has been 
observed that the codon usage patterns of organisms are highly non-random. Different 
species show a different bias in their codon selection and, furthermore, utilisation of 
codons may be markedly different in a single species between genes which are 

10 expressed at high and low levels. This bias is different in viruses, plants, bacteria and 
mammalian cells, and some species show a stronger bias away from a random codon 
selection than others. For example, humans and other mammals are less strongly 
biased than certain bacteria or viruses. For these reasons, there is a significant 
probability that a mammalian gene expressed in E. coli or a foreign or recombinant 

15 gene expressed in mammalian cells will have an inappropriate distribution of codons 
for efficient expression. It is believed that the presence in a heterologous DNA 
sequence of clusters of codons or an abundance of codons which are rarely observed 
in the host in which expression is to occur, is predictive of low heterologous 
expression levels in that host. 

20 

In an embodiment of the present invention there is provided a gpl20 polynucleotide 
sequence which encodes a substantially non-glycosylated gpl20 amino acid sequence, 
wherein the codon usage pattern of the polynucleotide sequence resembles that of 
highly expressed mammalian genes. Preferably the polynucleotide sequence is a DNA 
25 sequence. Desirably the codon usage pattern of the polynucleotide sequence is 
typical of highly expressed human genes. 

In the polynucleotides of the present invention, the codon usage pattern is altered from 
that typical of human immunodeficiency viruses to more closely represent the codon 
30 bias of the target organism, e.g. a mammal, especially a human. The "codon usage 
coefficient" is a measure of how closely the codon pattern of a given polynucleotide 
sequence resembles that of a target species. Codon frequencies can be derived from 
literature sources for the highly expressed genes of many species (see e.g. Nakamura 
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etal. Nucleic Acids Research 1996, 24:214-215). The codon frequencies for each of 
the 61 codons (expressed as the number of occurrences occurrence per 1000 codons of 
the selected class of genes) are normalised for each of the twenty natural amino acids, 
so that the value for the most frequently used codon for each amino acid is set to 1 and 
the frequencies for the less common codons are scaled to lie between zero and 1 . Thus 
each of the 61 codons is assigned a value of 1 or lower for the highly expressed genes 
of the target species. In order to calculate a codon usage coefficient for a specific 
polynucleotide, relative to the highly expressed genes of that species, the scaled value 
for each codon of the specific polynucleotide are noted and the geometric mean of all 
these values is taken (by dividing the sum of the natural logs of these values by the 
total number of codons and take the anti-log). The coefficient will have a value 
between zero and 1 and the higher the coefficient the more codons in the 
polynucleotide are frequently used codons. If a polynucleotide sequence has a codon 
usage coefficient of 1, all of the codons are "most frequent" codons for highly 
expressed genes of the target species. 

According to the present invention, the codon usage pattern of the polynucleotide will 
preferably exclude rare codons. Rare codons can be defined as codons representing 
<20% or more preferably representing <10% of the codons used for a particular amino 
acid in highly expressed genes of the target organism. Alternatively rare codons may 
be defined as codons with a relative synonymous codon usage (RSCU) value of <0.3 
or more preferably <0.2 in highly expressed genes of the target organism. An RSCU 
value is the observed number of codons divided by the number expected if all codons 
for that amino acid were used equally frequently. An appropriate definition of a rare 
codon would be apparent to a person skilled in the art. 

A polynucleotide of the present invention will generally have a codon usage 
coefficient for highly expressed human genes of greater than 0.3, preferably greater 
than 0.4, most preferably greater than 0.5. Preferably also the codon usage coefficient 
will be less than 1 .0, preferably less than 0.9 and more preferably less than 0.8. Thus 
a codon usage coefficient between 0.5 and 0.9 or between 0.5 and 0.8 is most 
preferred. Codon usage tables for human can also be found in Genbank. 
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In comparison, a highly expressed beta actin gene has a codon usage coefficient of 
0.747. 



The codon usage table for a homo sapiens is set out below: 



Codon usage for human (highly expressed) genes 1/24/91 (human Jiigh. cod) 

AmAcid Codon Number /10 00 Fraction 

Gly GGG 905.00 18.76 0.24 

Gly GGA 525.00 10.88 0.14 

Gly GGT 441.00 9.14 0.12 

Gly GGC 1867.00 38.70 0.50 

Glu GAG 2420.00 50.16 0.75 

Glu GAA 792.00 16.42 0.25 

Asp GAT 592.00 12.27 0.25 

Asp GAC 1821.00 37.75 0.75 

Val GTG 1866.00 38.68 0.64 

Val GTA 134.00 2.78 0.05 

Val GTT 198.00 4.10 0.07 

Val GTC 728.00 15.09 0,25 

Ala GCG 652.00 13.51 0.17 

Ala GCA 488.00 10.12 0.13 

Ala GCT 654.00 13.56 0.17 

Ala GCC 2057.00 42.64 0.53 

Arg AGG 512.00 10.61 0.18 

Arg AGA 298.00 6.18 0.10 

Ser AGT 354.00 7.34 0.10 

Ser AGC 1171.00 24.27 0.34 

Lys AAG 2117.00 43.88 0.82 

Lys AAA 471.00 9.76 0.18 

Asn AAT 314.00 6.51 0.22 

Asn AAC 1120.00 23.22 0.78 

Met ATG 1077.00 22.32 1.00 

lie ATA 88.00 1.82 0.05 

lie ATT 315.00 6.53 0.18 

lie ATC 1369.00 28.38 0.77 



19 



WO 2004/041852 



PCT/EP2003/012402 



Thr 
Thr 
Thr 
Thr 



ACG 
ACA 
ACT 
ACG 



405 . 00 
373 . 00 
358 . 00 
1502 . 00 



8*40 
7.73 
7 .42 
31 . 13 



0 . 15 
0 . 14 
0. 14 
0 .57 



Trp 
End 
Cys 
Cys 



TGG 
TGA 
TGT 
TGC 



652 . 00 
109 . 00 
325 . 00 
706. 00 



13 .51 
2 .26 
6. 74 

14 . 63 



1. 00 
0.55 
0.32 
0 . 68 



End 
End 
Tyr 
Tyr 



TAG 
TAA 
TAT 
TAC 



42 . 00 
46.00 
360.00 
1042 . 00 



0 . 87 
0.95 
7.46 
21.60 



0 . 21 

0 .23 

0 . 26 
0.74 



Leu 
Leu 
Phe 
Phe 



TTG 
TTA 
TTT 
TTC 



313 . 00 
76.00 
336 . 00 
1377 . 00 



6.49 
1.58 
6.96 
28 . 54 



0 . 06 
0 . 02 
0 .20 
0 .80 



Ser 
Ser 
Ser 
Ser 



TCG 
TCA 
TCT 
TCC 



325.00 
165 . 00 
450 . 00 
958 .00 



6.74 
3 .42 
9.33 
19. 86 



0 . 09 
0 . 05 
0 .13 
0 .28 



Arg 
Arg 
Arg 
Arg 



CGG 
CGA 
CGT 
CGC 



611. 00 
183 . 00 
210.00 
1086.00 



12 . 67 

3 .79 

4 .35 
22.51 



0.21 

0 . 06 

0 . 07 

0 .37 



Gin 
Gin 
His 
His 



CAG 
CAA 
CAT 
CAC 



2020 . 00 
283 .00 
234.00 
870. 00 



41. 87 
5 . 87 
4 . 85 

18 . 03 



0. 88 
0 . 12 
0.21 
0 .79 



Leu 
Leu 
Leu 
Leu 



CTG 
CTA 
CTT 
CTC 



2884 . 00 
166.00 
238 . 00 

1276. 00 



59.78 

3 .44 

4 . 93 
26.45 



0.58 
0. 03 
0 . 05 
0.26 



Pro 
Pro 
Pro 
Pro 



CCG 
CCA 
CCT 
CCC 



482 . 00 
456. 00 
568 . 00 
1410 . 00 



9.99 
9.45 
11. 77 
29 . 23 



0 . 17 
0 . 16 
0 . 19 
0.48 
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According to a further aspect of the invention, an expression vector is provided which 
comprises and is capable of directing the expression of a polynucleotide sequence 
according to the first aspect of the invention, in particular wherein the codon usage 
pattern of the gpl20 polynucleotide sequence is typical of highly expressed 
5 mammalian genes, preferably highly expressed human genes. The vector may be 

suitable for driving expression of heterologous DNA in bacterial insect or mammalian 
cells, particularly human cells. In one embodiment, the expression vector is p7313 
(see Figure 1). 

10 In a further aspect the invention provides a method of treating or preventing HIV 

infections, any symptoms or diseases associated therewith, comprising administering a 
safe and effective amount of a polynucleotide, a vector, a polypeptide or a 
pharmaceutical composition according to the invention. 

15 Administration of the pharmaceutical composition may take the form of one or of 
more than one individual doses, for example as repeat doses of the same DNA 
plasmid, or in a heterologous "prime-boost" vaccination regime, particularly a 
therapeutic vaccination regime. A heterologous prime-boost regime uses 
administration of different forms of vaccine in the prime and the boost, each of which 

20 may itself include two or more administrations. Preferably but not necessarily the 
priming and boosting composition comprise the same antigens or different forms of 
the same antigens. The priming composition and the boosting composition will 
anyway have at least one antigen in common, although it is not necessarily an 
identical form of the antigen, it may be a different form of the same antigen. An 

25 example of different forms of the same antigen is in the case of a polynucleotide 

encoding a gpl20 which lacks a functional signal sequence and is substantially non- 
glycosylated in mammalian cells, and a poplypeptide which is gpl20 with its signal 
sequence and which is glycosylated. A full length and a truncated version of the same 
protein, or a mutated and a non-mutated form of the same protein, may also be 

30 considered different forms of the same antigen for the purposes of a prime-boost 
format according to the invention. 
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In one example of a prime-boost regime the "prime" vaccination may be via particle 
mediated DNA delivery of a priming composition which comprises a polynucleotide 
or set of polynucleotides according to the present invention, preferably incorporated 
into a plasmid vector, while the "boost 55 administration may be of a boosting 
composition comprising a recombinant viral vector comprising the same 
polynucleotide sequence or a polynucleotide encoding at least one of the same 
antigens encoded by the priming composition. Alternatively the boosting may be 
carried out with at least one of the same antigens in the form of the protein in 
adjuvant. Conversely the priming may be with a priming composition comprising the 
viral vector or with a protein formulation typically a protein formulated in adjuvant, 
and the boost a DNA vaccine of the present invention. 

A preferred prime-boost format for use with the polynucleotides according to the 
present invention is selected from: 

Protein prime / live vector boost 

Live vector prime / protein boost 

Protein prime / DNA plasmid boost 

DNA plasmid prime / protein boost 

Live vector prime / DNA plasmid boost 

DNA plasmid prime / live vector boost 

Preferred live vectors include live virus vectors in particular adenovirus vectors as 
described herein. 

Preferably the priming and boosting compositions comprise, in addition to the 
antigen(s) a suitable adjuvant, which may be different according to the particular 
composition. 

Both the priming composition and the boosting composition may be delivered in more 
than one dose. Furthermore the initial priming and boosting doses may be followed 
up with further doses which may be alternated to result in e.g. a DNA plasmid prime / 
protein boost / further DNA plasmid dose / further protein dose. 
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The invention further provides a process for the production of a polynucleotide as 
described herein comprising linking a nucleotide sequence encoding a substantially 
non-glycosylated HIV envelope molecule, preferably gpl20 or fragment or 
immunogenic derivative thereof and optionally a sequence encoding an additional 
HIV protein for example an HIV non-structural protein such as Tat or an HIV capsid 
protein such as Gag, or fragments or immunogenic derivatives thereof, to a 
heterologous promoter sequence. 

As discussed above, the present invention includes expression vectors that comprise 
the nucleotide sequences of the invention. Such expression vectors are routinely 
constructed in the art of molecular biology and may for example involve the use of 
plasmid DNA and appropriate initiators, promoters, enhancers and other elements, 
such as for example polyadenylation signals which may be necessary, and which are 
positioned in the correct orientation, in order to allow for protein expression. Other 
suitable vectors and how to construct them would be apparent to persons skilled in the 
art. By way of further example in this regard we refer to Sambrook et al. Molecular 
Cloning: a Laboratory Manual. 2 nd Edition. CSH Laboratory Press. (1989). 

Preferably, a polynucleotide of the invention, or for use in the invention in a vector, is 
operably linked to a control sequence which is capable of providing for the expression 
of the coding sequence by the host cell, i.e. the vector is an expression vector. The 
term "operably linked" refers to a juxtaposition wherein the components described are 
in a relationship permitting them to function in their intended manner. A regulatory 
sequence, such as a promoter, "operably linked" to a coding sequence is positioned in 
such a way that expression of the coding sequence is achieved under conditions 
compatible with the regulatory sequence. 

A nucleic acid sequence of the present invention maybe administered by means of 
specialised delivery vectors useful in gene therapy. Gene therapy approached are 
discussed for example by Verme et al 9 Nature 1997, 389:239-242. Both viral and 
non-viral vector systems can be used. The vectors may be, for example, plasmids, 
artificial chromosomes (e.g. BAC, PAC, YAC), virus or phage vectors provided with 
a origin of replication, optionally a promoter for the expression of the polynucleotide 
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and optionally a regulator of the promoter. The vectors may contain one or more 
selectable marker genes, for example an ampicillin or kanamycin resistance gene in 
the case of a bacterial plasmid or a resistance gene for a fungal vector. Vectors may 
be used in vitro, for example for the production of DNA or RNA or used to transfect 
or transform a host cell, for example, a mammalian host cell e.g. for the production of 
protein encoded by the vector. The vectors may also be adapted to be used in vivo, for 
example in a method of DNA vaccination or of gene therapy. 



Examples of suitable viral vectors include retroviral, lentiviral, adenoviral, adeno- 
10 associated viral, herpes viral including herpes simplex viral, alpha- viral, pox viral 

such as Canarypox and vaccinia- viral based systems. Gene transfer techniques using 
these viruses are known to those skilled in the art. Retrovirus vectors for example 
may be used to stably integrate the polynucleotide of the invention into the host 
genome, although such recombination is not preferred. Replication-defective 
15 adenovirus vectors by contrast remain episomal and therefore allow transient 
expression. Vectors capable of driving expression in insect cells (for example 
baculovirus vectors), in human cells, yeast or in bacteria may be employed in order to 
produce quantities of the HIV protein encoded by the polynucleotides of the present 
invention, for example for use as subunit vaccines or in immunoassays. 

20 

In a preferred embodiment the adenovirus used as a live vector is a replication 
defective simian adenovirus. Typically these viruses contain an El deletion and can 
be grown on cell lines that are transformed with an El gene. Preferred Simian 
adenoviruses are viruses isolated from Chimpanzee. In particular C68 (also known as 
25 Pan 9) (See US patent No 6083 716) and Pan 5, 6 and Pan 7 (W0 03/046124) are 

preferred for use in the present invention. Thus these vectors can be manipulated to 
insert a heterologous gene of the invention such that the gene product maybe 
expressed. The use, formulation and manufacture of such recombinant adenoviral 
vectors is set forth in detail in WO 03/046142. 

30 

Promoters and other expression regulation signals may be selected to be compatible 
with the host cell for which expression is designed. For example, mammalian 
promoters include the metallothionein promoter which can be induced in response to 
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heavy metals such as cadmium, and the p-actin promoter. Viral promoters such as the 
S V40 large T antigen promoter, human cytomegalovirus (CMV) immediate early (IE) 
promoter, rous sarcoma virus LTR promoter, adenovirus promoter, or a HP V 
promoter, particularly the HPV upstream regulatory region (URR) may also be used. 
5 All these promoters are well described and readily available in the art. 

A preferred promoter element is the CMV immediate early promoter devoid of intron 
A, but including exon 1 . Accordingly there is provided a vector comprising a 
polynucleotide of the invention under the control of HCMV EE early promoter. A 
10 suitable HCMV IE promoter is described in WO 02/36792. 

Non-viral based systems include direct administration of nucleic acids, microsphere 
encapsulation technology, poly(lactide-co-glycolide) and liposome-based systems. 

1 5 The polynucleotides according to the invention have utility in the production by 

expression of the encoded proteins, which expression may take place in vitro, in vivo 
or ex vivo. The nucleotides may therefore be involved in recombinant protein 
synthesis, for example to increase yields, or indeed may find use as therapeutic agents 
in their own right, utilised in DNA vaccination techniques. Where the polynucleotides 

20 of the present invention are used in the production of the encoded proteins in vitro or 
ex vivo, cells, for example in cell culture, will be modified to include the 
polynucleotide to be expressed. Such cells include transient, or preferably stable 
mammalian cell lines. Particular examples of cells which may be modified by 
insertion of vectors encoding for a polypeptide according to the invention include 

25 mammalian HEK293T, CHO, HeLa, 293 and COS cells. Preferably the cell line 
selected will be one which is stable. Expression may be achieved in transformed 
oocytes. A polypeptide may be expressed from a polynucleotide of the present 
invention, in cells of a transgenic non-human animal, preferably a mouse. A 
transgenic non-human animal expressing a polypeptide from a polynucleotide of the 

30 invention is included within the scope of the invention. 

The invention further provides a method of vaccinating a mammalian subject which 
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comprises administering thereto an effective amount of a vaccine or vaccine 
composition according to the invention. Preferably, expression vectors for use in 
DNA vaccines, vaccine compositions and immunotherapeutics will be plasmid vectors 
or live viral vectors. 

5 

DNA vaccines may be administered in the form of "naked DNA", for example in a 
liquid formulation administered using a syringe or high pressure jet, or DNA 
formulated with liposomes or an irritant transfection enhancer, or by particle mediated 
DNA delivery (PMDDor particle mediated imunotherapeutic delivery PMID) as 
10 described in more detail herein. All of these delivery systems are well known in the 
art. The vector may be introduced to a mammal for example by means of a viral 
vector delivery system. 

The compositions of the present invention can be delivered by a number of routes 
15 such as intramuscularly, subcutaneously, intraperitonally, intravenously or mucosally. 

The invention further provides an intradermal delivery device comprising a 
pharmaceutical composition described herein. 

20 In a preferred embodiment, the composition is delivered intradermally. In particular, 
the composition is delivered by means of a gene gun particularly using particle 
bombardment administration techniques which involve coating the vector on to beads 
(eg gold beads) which are then administered under high pressure into the epidermis. 
This is described, for example,in Haynes et al, J Biotechnology 44: 37-42 (1996). 

25 

Numerous methods of carrying out a particle bombardment approach are known, see 
for example WO 91/07487. In one illustrative example, gas-driven particle 
acceleration can be achieved with devices such as those manufactured by Powderject 
Pharmaceuticals PLC (Oxford, UK) and Powderject Vaccines Inc. (Madison, WI), 
30 some examples of which are described in U.S. Patent Nos. 5,846,796; 6,010,478; 
5,865,796; 5,584,807; and EP Patent No. 0500 799. This approach offers a needle- 
free delivery approach wherein a dry powder formulation of microscopic particles, 
such as polynucleotide, are accelerated to high speed within a helium gas jet generated 
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by a hand held device, propelling the particles into a target tissue of interest, typically 
the skin. The particles are preferably gold beads of a 0.4 - 4.0 fim, more preferably 
0.6 - 2.0 fim diameter and the DNA conjugate coated onto these and then encased in a 
cartridge or cassette for placing into the delivery device. 

Li a related embodiment, other devices and methods that may be useful for gas-driven 
needle-less injection of compositions of the present invention include those provided 
by Bioject, Inc. (Portland, OR), some examples of which are described in U.S. Patent 
Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 and 
5,993,412. 

The vectors which comprise the nucleotide sequences encoding antigenic peptides are 
administered in such amount as will be prophylactically or therapeutically effective. 
The quantity to be administered, is generally in the range of one picogram to 1 
milligram, preferably 1 picogram to 10 micrograms for particle-mediated delivery, and 
100 nanograms to 10 milligrams for other routes of nucleotide per dose. The exact 
quantity may vary considerably depending on the weight of the patient being 
immunised and the route of administration. 

It is possible for the immunogen component comprising the nucleotide sequence 
encoding the antigenic peptide, to be administered on a one off basis or to be 
administered repeatedly, for example, between 1 and 7 times, preferably between 1 
and 4 times, at intervals between about 1 day and about 18 months. Further 
administrations may also be given as necessary to maintain immune responses for the 
lifetime of the patient. However, this treatment regime will be significantly varied 
depending upon the size of the patient concerned, the amount of nucleotide sequence 
administered, the route of administration, and other factors which would be apparent 
to a skilled medical practitioner. The patient may receive one or more other anti HIV 
retroviral drugs as part of their overall treatment regime. Additionally the nucleic acid 
immunogen may be administered with an adjuvant. 
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The adjuvant component specified herein can similarly be administered via a variety 
of different administration routes, such as for example, via the oral, nasal, pulmonary, 
intramuscular, subcutaneous, intradermal or topical routes. Preferably, the adjuvant 
component is administered via the intradermal or topical route, most preferably by a 
5 topical route. This administration may take place between about 14 days prior to and 
about 14 days post administration of the nucleotide sequence, preferably between 
about 1 day prior to and about 3 days post administration of the nucleotide sequence. 

The adjuvant component is, in one embodiment, administered substantially 
simultaneously with the administration of the nucleotide sequence. By "substantially 
simultaneous" what is meant is that administration of the adjuvant component is 
preferably at the same time as administration of the nucleotide sequence, or if not, it is 
at least within a few hours either side of nucleotide sequence administration. In the 
most preferred treatment protocol, the adjuvant component will be administered 
substantially simultaneously with administration of the nucleotide sequence. 
Obviously, this protocol can be varied as necessary, in accordance with the type of 
variables referred to above. It is preferred that the adjuvant is a 1H - imidazo [4,5c] 
quinoline - 4 - amine derivative such as imiquimod. Typically imiquimod will be 
presented as a topical cream formulation and will be administered according to the 
above protocol. 

Once again, depending upon such variables, the dose of administration of the 
derivative will also vary, but may, for example, range between about 0. 1 mg per kg to 
about 100 mg per kg, where "per kg" refers to the body weight of the mammal 
25 concerned. This administration of the lH-imidazo[4,5-c]quinolin-4-amine derivative 
would preferably be repeated with each subsequent or booster administration of the 
nucleotide sequence. Most preferably, the administration dose will be between about 
1 mg per kg to about 50 mg per kg. In the case of a "prime-boost" scheme as 
described herein, the imiquimod or other lH-imidazo[4,5-c]quinolin-4~amine 
30 derivative may be administered with either the prime or the boost or with both the 
prime and the boost. 
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While it is possible for the adjuvant component to comprise only lH-imidazo[4,5- 
c]quinolin-4-amine derivatives to be administered in the raw chemical state, it is 
preferable for administration to be in the form of a pharmaceutical formulation. That 
is, the adjuvant component will preferably comprise the lH-imidazo[4,5-c]quinolin-4- 
5 amine combined with one or more pharmaceutically acceptable carriers, and 

optionally other therapeutic ingredients. The carrier(s) must be "acceptable" in the 
sense of being compatible with other ingredients within the formulation, and not 
deleterious to the recipient thereof. The nature of the formulations will naturally vary 
according to the intended administration route, and may be prepared by methods well 

1 0 known in the pharmaceutical art. All methods include the step of bringing into 
association a lH-imidazo[4,5-c]quinolin-4-amine derivative with an appropriate 
carrier or carriers. In general, the formulations are prepared by uniformly and 
intimately bringing into association the derivative with liquid carriers or finely divided 
solid carriers, or both, and then, if necessary, shaping the product into the desired 

15 formulation. Formulations of the present invention suitable for oral administration 
maybe presented as discrete units such as capsules, cachets or tablets each containing 
a pre-determined amount of the active ingredient; as a powder or granules; as a 
solution or a suspension in an aqueous liquid or a non-aqueous liquid; or as an oil-in- 
water liquid emulsion or a water-in-oil emulsion. The active ingredient may also be 

20 presented as a bolus, electuary or paste. 

A tablet may be made by compression or moulding, optionally with one or more 
accessory ingredients. Compressed tablets may be prepared by compressing in a 
suitable machine the active ingredient in a free-flowing form such as a powder or 
25 granules, optionally mixed with a binder, lubricant, inert diluent, lubricating, surface 
active or dispersing agent. Moulded tablets may be made by moulding in a suitable 
machine a mixture of the powdered compound moistened with an inert liquid diluent. 

The tablets may optionally be coated or scored and may be formulated so as to provide 
30 slow or controlled release of the active ingredient. 

Formulations for injection via, for example, the intramuscular, intraperitoneal, 
intradermal,or subcutaneous administration routes include aqueous and non-aqueous 
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sterile injection solutions which may contain antioxidants, buffers, bacteriostats and 
solutes which render the formulation isotonic with the blood of the intended recipient; 
and aqueous and non-aqueous sterile suspensions which may include suspending 
agents and thickening agents. The formulations may be presented in unit-dose or 
multi-dose containers, for example, sealed ampoules and vials, and may be stored in a 
freeze-dried (lyophilised) condition requiring only the addition of the sterile liquid 
carrier, for example, water for injections, immediately prior to use. Extemporaneous 
injection solutions and suspensions maybe prepared from sterile powders, granules 
and tablets of the kind previously described. Formulations suitable for pulmonary 
administration via the buccal or nasal cavity are presented such that particles 
containing the active ingredient, desirably having a diameter in the range of 0.5 to 7 
microns, are delivered into the bronchial tree of the recipient. Possibilities for such 
formulations are that they are in the form of finely comminuted powders which may 
conveniently be presented either in a piercable capsule, suitably of, for example, 
gelatine, for use in an inhalation device, or alternatively, as a self-propelling 
formulation comprising active ingredient, a suitable liquid propellant and optionally, 
other ingredients such as surfactant and/or a solid diluent. Self-propelling 
formulations may also be employed wherein the active ingredient is dispensed in the 
form of droplets of a solution or suspension. Such self-propelling formulations are 
analogous to those known in the art and may be prepared by established procedures. 
They are suitably provided with either a manually-operable or automatically 
functioning valve having the desired spray characteristics; advantageously the valve is 
of a metered type delivering a fixed volume, for example, 50 to 100 jiL, upon each 
operation thereof. 

In a further possibility, the adjuvant component may be in the form of a solution for 
use in an atomiser or nebuliser whereby an accelerated airstream or ultrasonic 
agitation is employed to produce a find droplet mist for inhalation. 

Formulations suitable for intranasal administration generally include presentations 
similar to those described above for pulmonary administration, although it is preferred 
for such formulations to have a particle diameter in the range of about 10 to about 200 
microns, to enable retention within the nasal cavity. This may be achieved by, as 
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appropriate, use of a powder of a suitable particle size, or choice of an appropriate 
valve. Other suitable formulations include coarse powders having a particle diameter 
in the range of about 20 to about 500 microns, for administration by rapid inhalation 
through the nasal passage from a container held close up to the nose, and nasal drops 
5 comprising about 0.2 to 5% w/w of the active ingredient in aqueous or oily solutions. 
In one embodiment of the invention, it is possible for the vector which comprises the 
nucleotide sequence encoding the antigenic peptide to be administered within the 
same formulation as the lH-imidazo[4,5-c]quinolin-4-amine derivative. Hence in this 
embodiment, the immunogenic and the adjuvant component are found within the same 
1 0 formulation. 

In one embodiment the adjuvant component is prepared in a form suitable for biolistic 
administration, and is administered via that route substantially simultaneously with 
administration of the nucleotide sequence. For preparation of formulations suitable 
15 for use in this manner, it may be necessary for the lH-imidazo[4,5-c]quinolin-4-amine 
derivative to be lyophilised and adhered onto, for example, particles such as gold 
beads which are suited for biolistic administration. 

In an alternative embodiment, the adjuvant component may be administered as a dry 
20 powder, via high pressure gas propulsion. 

Even if not formulated together, it may be appropriate for the adjuvant component to 
be administered at or about the same administration site as the nucleotide sequence. 

25 Other details of pharmaceutical preparations can be found in Remington's 

Pharmaceutical Sciences, Mack Publishing Company, Easton, Pennsylvania (1985), 
the disclosure of which is included herein in its entirety, by way of reference. 

Suitable techniques for introducing the naked polynucleotide or vector into a patient 
30 also include topical application with an appropriate vehicle. The nucleic acid may be 
administered topically to the skin, or to mucosal surfaces for example by intranasal, 
oral, intravaginal or intrarectal administration. The naked polynucleotide or vector 
may be present together with a pharmaceutical^ acceptable excipient, such as 
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phosphate buffered saline (PBS). DNA uptake may be further facilitated by use of 
facilitating agents such as bupivacaine, either separately or included in the DNA 
formulation. Other methods of administering the nucleic acid directly to a recipient 
include ultrasound, electrical stimulation, electroporation and microseeding which is 
5 described in US 5,697,901 . 

Uptake of nucleic acid constructs may be enhanced by several known transfection 
techniques, for example those including the use of transfection agents. Examples of 
these agents include cationic agents, for example calcium phosphate and DEAE- 
10 Dextran and lipofectants, for example lipofectam and transfectam. The dosage of the 
nucleic acid to be administered can be altered. 



A nucleic acid sequence of the present invention may also be administered by means 
of specialised delivery vectors useful in gene therapy. Gene therapy approaches are 

15 discussed for example by Verme et al, Nature 1997, 389:239-242. Both viral and 
non- viral vector systems can be used and are described above. Viral and non-viral 
delivery systems may be combined where it is desirable to provide booster injections 
after an initial vaccination, for example an initial "prime" DNA vaccination using a 
non-viral vector such as a plasmid followed by one or more "boost" vaccinations 

20 using a viral vector or non-viral based system. Similarly the invention contemplates 
prime boost systems with the polynucleotide of the invention, followed by boosting 
with protein in adjuvant or vice versa. 

A nucleic acid sequence of the present invention may also be administered by means 
25 of transformed cells. Such cells include cells harvested from a subject. The naked 
polynucleotide or vector of the present invention can be introduced into such cells in 
vitro and the transformed cells can later be returned to the subject. The 
polynucleotide of the invention may integrate into nucleic acid already present in a 
cell by homologous recombination events. A transformed cell may, if desired, be 
30 grown up in vitro and one or more of the resultant cells may be used in the present 

invention. Cells can be provided at an appropriate site in a patient by known surgical 
or microsurgical techniques (e.g. grafting, micro-injection, etc.) 
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The pharmaceutical compositions of the present invention may include adjuvant 
compounds as detailed above, or other substances which may serve to increase the 
immune response induced by the protein which is encoded by the DNA. These may be 
encoded by the DNA, either separately from or as a fusion with the antigen, or may be 

5 included as non-DNA elements of the formulation. Examples of adjuvant-type 

substances which may be included in the formulations of the present invention include 
ubiquitin, lysosomal associated membrane protein (LAMP), hepatitis B virus core 
antigen, FLT3-ligand (a cytokine important in the generation of professional antigen 
presenting cells, particularly dentritic cells) and other cytokines such as IFN-y and 

10 GMCSR Other preferred adjuvants include imiquimod and resimquimod and 
tucarasol, imiquimod being particularly preferred. 

In a particular embodiment of the invention there is provided the use of a nucleic acid 
molecule as herein described for the treatment or prophylaxis of HIV infection, 
1 5 administered with imiquimod. The imiquimod is preferably administered topically, 
whereas the nucleic acid molecule is preferably administered by means of particle 
mediated delivery. 

Accordingly the present invention also provides a method of treating a subject 
20 suffering from or susceptible to HIV infection, comprising administering a nucleic 
acid molecule as herein described and imiquimod. 

The present invention will now be described by reference to the following examples: 
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EXAMPLES 

Example 1 : Plasmid Construction 

5 1.1 Construction of gpl20 containing plasmid 

Recombinant gpl20 glycoprotein described in the following examples is a synthetic 
form of the gpl20 envelope protein of HTV-1 isolate W61D. 

10 Codon Optimised fpgpl20c): 

The gene sequence was based on the gpl20 sequence from the HIV-1 isolate W61D. 
This has a Codon Usage Coefficient of 0.297. Optimisation was performed using 
SynGene 2d, resulting in a CUC of 0.749 (Ertl, PF., Thomsen, LL. Technical issues in 

15 construction of nucleic acid vaccines. (2003) Methods 31(3); 199-206. SynGene 
uses a mathematical method for codon optimisation based on the relative frequencies 
of use. Briefly, codons are assigned value ranges according to their frequencies, so 
that more frequent codons have wider ranges, and placed in ascending frequency 
order. The value ranges are expressed as >=0.000, >=0.0??, >=0.??? And so on. A 

20 random number is generated between 0 and 0.99999. This is then used to select a 

codon, which will be the codon allocated the range within which the random number 
falls. To exclude rare codons the value 0.1 is added to the random number, so that it 
falls in the range 0.1-1.09999. 

25 The gpl20 sequence was split into 40 overlapping oligonucleotides, PCR assembled 
and recovered using the end primers. The gene was cloned into vector p73 13-ie 
(shown in Figure 1) as a Notl-BamHI fragment and sequenced. Restriction fragments 
from three initial clones were combined to generate a single correct clone. The amino 
acid sequence and codon optimised DNA sequence are given in Figure 2. 

30 

1.2 Generation of Nef/Tat containing plasmids 



Nef/Tat faNTm and ntrNTm) 
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The gene for the Nef7Tat fusion protein was provided in plasmid pRIT 15244 (Figure 
3). The plasmid pRIT 15244 is identical to pRIT 14913 described below except that 
the His tail has been deleted. 

5 General 

The Nef gene from the Bru/Lai isolate (Cell 40: 9-17, 1985) was selected for the 
constructs since this gene is among those that are most closely related to the 
consensus Nef 

10 

The starting material for the Bru/Lai Nef gene was a 1 170bp DNA fragment cloned on 
the mammalian expression vector pcDNA3 (pcDNA3/Nef). 

The Tat gene originates from the BH10 molecular clone. This gene was received as 
15 an HTLV IE cDNA clone named pCVl and described in Science, 229, p69-73, 1985. 
This tat gene bears mutations in the active site region (Lys41-*Ala) and in RGD motif 
(Arg78-»Lys and Asp80->Glu) (Virology 235: 48-64, 1997). 

The mutant tat gene was received as a cDNA fragment subcloned between the EcoRI 
20 and HindHI sites within a CMV expression plasmid (p CM VLys4 1 /KG E) 

Construction of vector uRIT14597 ( encoding Nef-His protein). 

25 The nef gene was amplified by PCR from the pcDNA3/Nef plasmid with primers 01 
and 02. 

Ncol 

PRIMER 01: 5'ATCG TCCATG.G GT.GGC.AAG.TGG.T 3' [SEQ ID NO: 1] 

30 

Spel 

PRIMER 02: 5' CGGC TACTAGT GCAGTTCTTGAA 3 ? [SEQ ID NO:2] 

35 



WO 2004/041852 



PCT/EP2003/012402 



The integrative vector PHIL-D2 (INVITROGEN) was used. This vector was modified 
in such a way that expression of heterologous protein starts immediately after the 
5 native ATG codon of the AOX1 gene and will produce recombinant protein with a tail 
of one glycine and six histidines residues. This PHIL-D2-MOD vector was 
constructed by cloning an oligonucleotide linker between the adjacent AsuII and 
EcoRI sites of PHIL-D2 vector. In addition to the His tail, this linker carries Ncol, 
Spel and Xbal restriction sites between which nef 9 tat and nef-tat fusion were inserted. 

10 

The nef PCR fragment obtained and the integrative PHIL-D2-MOD vector were both 
restricted by Ncol and Spel, purified on agarose gel and ligated to create the 
integrative plasmid pRIT14597. 

15 

Construction of vector pRIT14913 (encoding fusion Nef-Tat mutant-Hisl 

To construct pRIT14913, the tat mutant gene was amplified by PCR from the 
pCMVLys41/KGE plasmid with primers 03 and 04. 

20 

Spel 

PRIMER 03: 5' ATCG TACTAGT. GAG.CCA.GTAGAT.C 3' [SEQ ID NO: 3] 

25 Spel 

PRIMER 04: 5 5 CGGCTACTAGTTTCCTTCGGGCCT 3 ? [SEQ ID NO: 4] 



30 The PCR fragment obtained and the plasmid pRIT14597 (expressing Nef-His protein) 
were both digested by Spel restriction enzyme, purified on agarose gel and ligated to 
create the integrative plasmid pRIT14913. 
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1.3 Generation of PMID vectors for gpl20 and Nef/Tat: 



gpl20: Codon-optimised gpl20 was provided as described above. 
5 Nef/Tat (pNTm and ptrNTm): 

The gene for the Nef/Tat fusion protein was provided in plasmid pRIT 15244 
described above. The Tat in this plasmid contains three mutations to inactivate the 
transactivation function. The fusion contains full length Nef which has an immune 
10 modulatory function (Collins and Baltimore (1999)) that may be abrogated by N- 
terminal truncation. Therefore constructs were generated for both full length 
Netfmutant Tat(pNTm) and truncated Nef/mutant Tat(ptrNTm), in which the first 65 
amino acids of Nef were removed. These sequences were PGR amplified from 
pRIT 15244 using primers: 

15 

5'Nef GAATTCGCGGCCGCCATGGGTGGCAAGTGGTCAAAAAG 
5 'trNef GAATTCGCGGCCGCC ATGGTGGGTTTTCCAGTC AC ACC 
3 Tat GAATTCGGATCCTTATTCCTTCGGGCCTGTCGGG 
[SEQIDNOS: 5,6,7] 

20 

The genes were cloned into vector p7313-ie as Notl-BamHI fragments and sequenced. 
PNTm and ptrNTm and the Nef/Tat and truncated Nef/Tat sequences are shown in 
Figures 4 and 5. 

25 Dual expression vectors: (pRIXl and pRIX2) 

The Nef/Tat and trNef/Tat expression cassettes were excised as Clal-XmnI restriction 
fragments, and ligated into the Clal and blunted Sse8387 I sites of the vector 
containing the codon optimised gpl20 (pgpl20c) to provide single plasmids for 
30 expression of both proteins (pRIXl and pRIX2 respectively). 
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Composition of plasmid p7313-ie (Figure 1) 
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The plasmid was constructed by replacing the beta-lactamase gene containing 
Eaml 1051 - Pstl fragment of pUC19 (available from Amersham Pharmacia Biotech 
UK Ltd., Amersham Place, Little Chalfont, Bucks, HP7 9NA) with an EcoRI 
fragment of pUC4K (Amersham-Pharmacia) containing the Kanamycin resistance 
gene, following blunt ending of both fragments using T4 DNA polymerase. The 
human Cytomegalovirus IE1 promoter /enhancer, Intron A, was derived from plasmid 
JW4303 obtained from Dr Harriet Robinson, University of Massachusetts, and 
inserted into the Sail site of pUC19 as a Xhol -Sail fragment, incorporating the 
bovine growth hormone polyadenylation signal. Deletion of the 5 f Sall-BanI fragment 
from the promoter generated the minimal promoter used in the vector (WO00/23592 - 
Powderject Vaccines Inc.). HBV Surface antigen 3'UTR was derived from Hepatitis B 
Virus, serotype adw, in the vector pAM6 (Moriarty et ah, Proc.Natl.Acad.Sci. USA, 
78, 2606-10, 1981). pAM6 (pBR322 based vector) was obtained from the American 
Type Culture Collection, catalogue number ATCC 45020. The 3'UTR was inserted 5' 
to the polyadenylation signal as a 1 .4kb BamHI fragment, blunt ended for insertion to 
remove the BamHI sites. In a series of steps (including digestion with Bgl II, Klenow 
polymerase treatment, digestion with BstX I, digestion with Nco I, treatment with 
mung bean nuclease to remove overhang and further digestion with BstX I), 
modifications were made to the region between the 3 'untranslated enhancer region of 
the HBV S gene and bGHpA signal to remove all open reading frames of greater than 
5 codons between the X gene promoter and the bGHpA signal. This resulted in 
deletion of sequence encoding the translatable portion of the X protein (9 amino acids) 
and the X gene start codon. The bovine growth hormone polyadenylation signal was 
substituted with the rabbit beta globin polyadenylation signal. The 5 'non-coding and 
coding sequences of the S antigen were excised and replaced with an oligonucleotide 
linker to provide multiple cloning sites as shown to produce plasmid p7313-PL. 

Hind NotI-- EcoRV- -Ndel- -BamHI 

AGCTTGCGGCCGCTAGCGATATCGGTACCATATGTCGACGGATCC 

ACGCCGGCGATCGCTATAGCCATGGTCTACAGCTGCCTAGGCCGG 

38 



WO 2004/041852 

-Nhel- 

[SEQ ED NO: 8] 



-Kprxl - 



-Sail- 



PCT/EP2003/012402 

ANotI 



This polylinker was further extended by insertion of an additional oligonucleotide 
linker between the Kpnl and Sail sites: 

Asp I- -Muni- Nael- Ndel-- Bglll- 

GTACCGGTCAATTGGCGCCGGCGCGCCATATGACGTCAGATCTG 

GCCAGTTAACCGCGGCCGCGCGGTATACTGCAGTCTAGACAGCT 

--Agel- -Narl-- Aatll- Sail 

[SEQ ID NO: 9] 

The ColEl cer sequence was obtained from a subclone from plasmid pDAH212 from 
David Hodgeson (Warwick University) and amplified by PGR using primers to place 
EcoRI restriction sites at the ends of the sequence. The cer sequence was then inserted 
into the EcoRI site of p7313-PL to produce plasmid p7313-PLc. The sequence of the 
amplified cer was verified against the Genbank entry Ml 141 1 . 
The HBV 3'UTR sequence between the promoter and polyadenylation signal was 
removed by PCR amplification of the polyadenylation signal using primers: 
sense: CCATGGATCCGATCTTTTTCCCTCTGCC [SEQ ID NO: 10] 
antisense: GTTAGGGTGAAAAGCTTCCGAGTGAGAGACAC [SEQ ID NO: 11] 
The resulting product was cut with BamHI and Xmnl and used to replace the 
corresponding fragment containing both the polyadenylation signal and the 3'UTR. 
The Intron A sequence was removed from the plasmid by PCR amplification of the 
CMV promoter/enhancer using primers: 

sense: GCTAGCCTGCAGGCTGACCGCCCAACGAC [SEQ ID NO: 12] 
antisense: GTTCTCCATCGCGGCCGCACTCTTGGCACGGGG [SEQ ID NO: 13] 
The resulting product was cut with Sse8387 I and NotI, and inserted back into the 
Sse8387 I and NotI sites of the parental vector. 



Example 2: Modification of gp!20 and Nef/Tatftnut) expression vectors 
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gpl20 constructs were modified to reduce secretion of the protein. 
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Generation of constructs: 

g p!20 without a secretion signal (dsgp!2CL pRix 12 - see Figure 2 and 6) 

The gpl20 gene was PCR amplified from pgpl20c using the following primers: 

5 ' 1 20ds: 5 ' GAATTCGCGGCCGCCATGGCCGAGC AGCTGTGGGTCACC 

[SEQ ID NO: 14] 

L01: 

5 ? GAATTCGGATCCTC ATCTCTGC ACGACGCGGCGCTTGGCCCGGGT 
GGGGGCCACG [SEQ ID NO: 15] 

Fragments were amplified using PWO DNA polymerase (Roche) and the cycle: 

95°C (30s) 95°C(30s) 50°C(30s) 72°C(90s) 72°C(120s) 4°C(hold) 

\ repeat x20 / 

The products were cut with NotI and BamHI and cloned into p7313~ie to give pRixl2 
(Figure 6). 

Results 

In 293T cells the vector pRIX12 ? which lacks the secretion signal, makes a good 
amount of a 60kDa non-glycosylated protein that is not secreted (Figures 29 and 30). 

Example 3: Construction of vectors for expression of gp!20 and NefZTat(mut) 
from a single p las mid 

Vector construction: 
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The gpl20 Nef/Tat(m) constructs were generated by PCR stitching the gpl20 and 
Nef/Tat(m) or trNef/Tat(m) orfs. 



5' and 3' Gpl20, 5' and 3' Nef/Tat(m) and 5frNef/Tat were amplified from pRixl. 
5 3'trNef/Tat(m) was amplified from pNTm. The following primers were used: 

3T20: (antisense to): GCCAAGCGCCGCGTCGTGCAGAGA [SEQ ID NO: 16] 
5*120/NT: 

GCCAAGCGCCGCGTCGTGCAGAGAATGGGTGGCAAGTGGTCAAAAAGT 

10 [SEQ ID NO: 17] 

3'NT (antisense to): GGGGAGCCGACAGGCCCGAAGGAA [SEQ ID NO: 1 8] 

e 

5^17120: 

GGGGAGCCGACAGGCCCGAAGGAAATGAAGGTCAAGGAGACCAGAAAG 
15 [SEQ ID NO: 19] 
5'120/trN: 

GCCAAGCGCCGCGTCGTGCAGAGAATGGTGGGTTTTCCAGTCAC 

[SEQ ID NO: 20] 
5'trNef: 

20 GAATTCGCGGCCGCCATGGTGGGTTTTCCAGTCACACC [SEQ ID 

NO: 21] 
LOl: 

GAATTCGGATCCTCATCTCTGCACGACGCGGCGCTTGGCCCGGGTG 
GGGGCCACG [SEQ ID NO: 22] 

25 

L02: 

ACCACCTTGTACTTGTACAGCTCGCTCCGCCAGTTATCCCTCATGTC 
GCCGCCGCCGGGC [SEQ ID NO: 23] 

30 

Fragments were amplified using P WO DNA polymerase (Roche) and the cycle: 

95°C (60s) 95°C(30s) 55°C(30s) 72°C(120s) 72°C(120s) 4°C(hold) 
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\ repeat x20 / 

Primer LI was used as the 3' primer for 3 ! gpl20. However there were problems using 
this primer when stitching Nef/Tat or trNef/T to the 5' end of gpl20 so primer L2 was 
5 used instead. 

The stitched gpl20~N/Tm and gpl20-trN/Tm fragments were cut with Not 1 and 
BamHI and cloned into similarly cut p7313-ie. Due to the use of primer L2 rather than 
LI the N/Tm-gpl20 and trN/Tm-gpl20 fragments lacked a BamHI site, so these were 
10 cut with NotI and AccI, and cloned into similarly cut pgpl20c. All inserts were fully 
verified by sequencing. The plasmids were designated pRix6 (gpl20c NefTat™) and 
pRixll (gpl20ctrNefTat m ). 

15 Example 4: Construction of vectors to invesigate the effects of glycosvlation and 
secretion, inclusion of Tat and inclusion of Gag (pi 7/24) and Nef and RT on 
gp!20 and gp!20 fusions 

Vectors were constructed as shown in Figure 29 and 30 (schematic). 

20 

pRix 28 and t>Rix29 (Figures 7 and 8) 

pRix28 and 29 containing ds gpl20c NefTat m and ds gpl20c trNefTat were generated 
by transferring the AccI-BamHI fragments from pRix6 (23 1 5bp) and pRixl 1 (2123bp) 
25 into similarly cut pRixl2 (ds gpl20c). 

pRix30 and pRix3 1 (Figure 9) 

To generate glycosylated and non-glycosylated fusion vectors of gp 120c Nef without 
30 Tat, the Notl-Kpnl fragment was transferred from pRixl 1 (1 580bp) or pRix29 
(1496bp) into similarly cut pRixl5, a vector containing Tat/trNef. 

(pRix!5^ - TatfmufttrNef 
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The genes for Tat and trNef were PCR amplified from pNTm using the following 
primers: 

5 5'Tat: 5 ? GAATTCGCGGCCGCCATGGAGCCAGTAGATCCTAGAC [SEQ ID NO: 24] 
3'Tat: 5 5 TTCCTTCGGGCCTGTCGGC [SEQIDNO:25] 

5'trTN: GCCGACAGGCCCGAAGGAAATGGTGGGTTTTCCAGTCACAC [SEQ ID 
NO: 26] 

3'Nef: GAATTCGGATCCTTAGCAGTTCTTGAAGTACTCCGG [SEQ ID NO: 27] 

10 

The individual genes were gel purified and then PCR stitched to give TmtrN using the 
end primers. The fusion was then digested with NotI and BamHI and cloned into 
p7313-ie. 

15 pRix32 



To generate the fusion containing pl7/24, gpl20 was PCR amplified from pgpl20c 
using primers Ul and 3 ! 120, pl7/24~Nef was amplified from p73i-GN2 using primers 
5 f 120G and 3'Nef, and the two were PCR stitched using Ul and 3 f Nef. p73I-GN2 

20 contained a synthetic codon optimised sequence of pl7/p24 based on the sequence of 
HXB2 (GenBank entry K03455) and designed using SynGene and assembled from 
overlapping oligonucleotides as described for codon optimised gpl20 above, fused to 
HXB2 Nef, which had been obtained from plasmid pHXBAPr (B.Maschera, E Furfine 
and E.D. Blair 1995 J.Virol 69 543 1-5436) by PCR. Since the HXB2 nef gene in this 

25 plasmid contains a premature termination codon two overlapping PCRs were used to 
repair the codon (TGA [stop] to TGG [Trp]). The position of the repaired codon is 
underlined in the sequence. The pl7/p24/Nef gene was inserted into the NotI and 
BamHI sites of plasmid p7313ie. The coding sequence and map is given in Figure 10. 
A * marks the p24/trNef junction. 



30 



Primers: 
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Ul: 

GAATTCGCGGCCGCAATGAAGGTCAAGGAGACCAGAAAGAACTACCAGC 
ATCTGTG [SEQ ID NO: 28] 

3'120: TCTCTGCACGACGCGGCGCTTGGC [SEQ ID NO: 29]5'120G: 
5 GCCAAGCGCCGCGTCGTGGAGAGAATGGGTGCCCGAGCTTCGGTAC [SEQ 

ID NO: 30] 

3'Nef: GAATTCGGATCCTTAGCAGTTCTTGAAGTACTCCGG [SEQ ID NO: 31] 
Initial cycle: 

10 94°C(30s) 20x[94°C (30s) 50°C (30s) 68°C (180s)] 68°C (120s) 4°C (0s) 
Using pfx polymerase with lx enhancer. 

Stitch: 

94°C(30s) 20x[94°C (30s) 50°C (30s) 72°C (180s)] 72°C (120s) 4°C (0s) 
15 Using Vent polymerase in standard conditions + 2mM MgCfe. 

The product was cut with NotI and BanxHI and cloned into p73 13-ie. 

On sequencing the construct was found to have a error in the signal peptide, which 
20 was corrected by transferring the 2560bp BstEII - Kpnl fragment containing the back 
of gpl20 to the front of Nef into pRix30. 

pRix 33 (Figure U\ t>Rix34, andpRix35 (Figure 12) 

25 The 2560bp BstEII - Kpnl fragment containing the back of gpl20 to the front of Nef 
was transferred to pRixSl, pRixl 1 and pRix29 to make vectors pRix 33 (Figure 1 1), 
34 and 35 (Figure 12) respectively. 

pRix39 (gp!20 codon optimised, minus secretion signal - pi 7/24 gag - Nef-Tat - 
30 Figure 13) andpRix40-47 (Constructs pRix 40-47 contain non-glycosylated gpl20, 
gag-p 17/24, Nef and Tat(m) fusions with mutations in the miristoylation site and/or 
dileucine motif of Nef) 
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A fragment containing gag pi 7/24 was PCR amplified from vector pRix35 using 
primers: 



5 ! 120G: 

GCCAAGCGCCGCGTCGTGGAGAGAATGGGTGCCCGAGCTTCGGTAC [SEQ 
ID NO: 32] 

and 

p24AS: CAACACTCTGGCTTTGTGTCC [SEQ ID NO: 33] 
Full length Nef was PCR amplified from pNTm using primers: 

5 r p24-N: GGACACAAAGCCAGAGTGTTGATGGGCAAGTGGTCAAAAAGTAG 
[SEQ ID NO: 34] 

and 

3'Nef: GAATTCGGATCCTTAGCAGTTCTTGAAGTACTCCGG [SEQ ID NO: 35] 

The two fragments were PCR stitched together using the end primers (5 f 120G and 
3 'Nef). The product was cut with Sail and Kpnl, and the 423bp fragment containing 
part of p24 and Nef was used to replace the corresponding fragment in pRix35 to 
make pRix39 (Figure 13). 

pRix40 (Figure 14) was similarly constructed except primer 5 ! p24-N was replaced 
with primer: 

5'p24-Ndm: 

GGACACAAAGCCAGAGTGTTGATGGGCAAGTGGTCAAAAAGTAG [SEQ ID NO: 36] 
GHKARVL|MGKWS KS 

This primer deletes the one G to destroy the miristoylation site at the start of Nef 
pRix41-44, 46 and 47 f Figures 15 to 20) 

Mutations to the dileucine motif in Nef (LI 74L1 75) were made by PCR: 
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To insert the mutations, the portion of Nef 5' to the LL motif was PCR amplified 
using the 5 f Nef primer and asNefLL 



5 'Nef GAATTCGCGGCCGCCATGGGTGGCAAGTGGTCAAAAAG [SEQ ID 

5 NO: 37] 

asNef LL (Antisense to) 

GCCAATAAAGGAGAGAACACCAGC [SEQ ID NO: 38] 
ANKGENTS 

10 Mutations to L174 5 L175 or both 174 and 175 were generated using forward primers 
sNefLl (L174A) 

GCCAATAAAGGAGAGAACACCAGCGCCTTACACCCTGTGAGCCTGCATG 
ANKGENTS) ALHPVS LH 

15 [SEQ ID NO: 39] 

sNefL2 (L175A) 

GCCAATAAAGGAGAGAACACCAGCTTGGCACACCCTGTGAGCCTGCATG 
ANKGENTS] LaHPVSLH 

20 [SEQ ID NO: 40] 

SnefLL (LL174/5AA) 

GCCAATAAAGGAGAGAACACCAGCGCCGCACACCCTGTGAGCCTGCATG 
ANKGENTS|AAHPVSLH 

25 [SEQ ID NO: 41] 

and the 3 f NT primer: 

3'NT (antisense to) : GGGGAGCCGAC AGGCCCGAAGGAA [SEQ ID NO: 42] 

30 

to amplify the 3' portion of Nef. The 5' and each of the 3 ! products were PCR stitched 
using the 5'Nef and 3'NT primers. These were cut with Kpnl and Spel and inserted 
into similarly cut pRix39 to generate pRix41 (L174A), pRix42 (L175A), and pRix43 
(LL174/175A) in the absence of the myristoylation site mutation, or into pRix40 to 
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generate pRix44 (mLL174/175AA) pRix46 (mL174A) and pRix47 (mL175A) with 
the myristoylation site mutation. 



pRix5(X 51 and 52 and 53. 54 and 60 (Figures 23 to 27) 
5 To generate dual promoter vectors containing non-glycosylated gpl20 initially pRixl2 
was cut with Sse8387I, blunt ended with T4 DNA polymerase to remove the 
overhangs, and cut with Clal. The 4573bp product was gel purified (Fragment A) 

pRixSO: the plasmid pNTm was cut with Clal and XmnI and the 1897bp fragment was 
1 0 gel purified and ligated with Fragment A. 

pRix5 1 : the plasmid p73i-GN2 was cut with Clal and XmnI and the 253 3bp fragment 
was gel purified and ligated with Fragment A. 

1 5 pRix52: Plasmids p73i-GN2 and pNTm were cut with Bglll and EcoRI, and the 
4090bp and 1060bp fragments were isolated. From the respective digests. These 
fragments were ligated to produce the vector pRix52. 

pRix53: the plasmid pRix52 was cut with Clal and XmnI and ApaLI and the 2794bp 
20 fragment was gel purified and ligated with Fragment A. 

pRix54: the plasmid pT-mg (described in PG5035) was cut with Clal and XmnI and 
the 4219bp fragment was gel purified and ligated with Fragment A. 

25 pRix60: pT-rng was cut with Sse83 871, blunt ended with T4 DNA polymerase to 
remove the overhangs, and cut with Clal. The large fragment was gel purified. The 
plasmid pRixl2 was cut with Clal and XmnI and Dral and the 2476bp fragment was 
gel purified and ligated with the fragment from pT-mg. 

30 pRix58 (Figure 21) 



The gpl20 fragment without signal sequence was PGR amplified from pgpl20c using 
the primers: 
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5' els 12 0: GAATTCGCGGCCGCCATGGCCGAGCAGCTGTGGGTCACC [SEQ ID 

NO: 43] 

3120: (antisense to): GCCAAGCGCCGCGTCGTGCAGAGA [SEQ ID NO: 44] 

5 

the 5 ! end of RT (codon optimised and containing the W229K inactivating mutation) 
was PCR amplified from pt-rng (Figure 28 - see also WO 03/025003) using a 5' 
primer to insert a sequence homologous to 3' 120, and a primer within RT 

10 120RTf: GCCAAGCGCCGCGTCGTGCAGAGAATGGGCCCCATCAGTCCCATC 
[SEQ ID NO: 45] 

RT3SR1: CGTCACGATGTTCACCTCCAGGCC [SEQ ID NO: 46] 

The two products were PCR stitched using the end primers, and cut with NotI and 
1 5 NheL The fragment was gel purified and used to replace the NotI -Nhel fragment from 
pT-rng. 

pRix59 (Figure 22) : the 3 'Gag fragment was PCR amplified from pT-rng using a 
primer 5' to the Muni site in p24 and a 3 r primer encoding the start of dsgpl20 5 
20 covering the position of the BstEII site near the 5 f end: 

GagMunf 

GTGGCCCGAGAGCTGCATCCG [SEQ ID NO: 47] 

25 GAG120R (Antisense to :) 

Gag dsl20 

GGACACAAAGCCAGAGTGTTGATGGCCGAGCAGCTGTGGGTCACCGTC 

[SEQ ID NO: 48] 

30 The product was cut with Muni and BstEII, and inserted into the 71 13bp fragment 
from Munl-BstEII cut pRix54. 
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Expression data is shown in Figures 29, 30 and 3 1 . 

5 293T cell monolayers in 24 well plates were transfected with 1 \ig of each DNA 

indicated using Lipofectamine 2000 following the manufacturer's supplied protocol. 
After 24 hours the cells were detached and separated from the culture medium by 
centrifugation. Samples equivalent to lxl 0 4 cells or 12^1 of medium were examined 
by PAGE and Western blot. 

10 

The gpl20c construct gave a highly glycosylated well secreted protein. Addition of c- 
terminal Nef/Tat fusions (pRix 6 and pRixl 1) resulted in a reduction of the 
intracellular protein levels and loss of secretion. Removal of the secretion signal from 
gpl20c (pRixl2) gave a non-glycosylated non-secreted form of the protein. 

15 

As expected, fusion constructs with no secretion signal pRix28, 29, 31, 33 and 35, 
made non-glycosylated intracellular proteins in similar amounts, though expression 
from pRix35 was somewhat reduced. Surprisingly, when the secretion signal was 
present in constructs pRix30, 32 and 34, only pRix34 failed to be secreted. It appears 
20 that the presence of Tat in the fusion inhibits secretion of the protein. The initial 
pRix32. 1 construct had a point mutation resulting in poor expression. This was 
corrected in pRix32.7, which showed greatly improved expression. 

For pRix40-47 the western blot in Figure 3 1 shows the expression of the 
25 dsgp 1 20/Gag/Nef/Tat fusions with mutations in Nef in 293T cells 24hours post 

transfection with the plasmids indicated. Total cell extracts equivalent to ~lxl0 4 cells 
were loaded onto the gel. The blot was probed with an anti-nef antiserum. 

For pRix50, 51, 53, 54, 58, 59 and 60 the two sets of western blots in Figure 31 show 
30 the expression of the products of the dual promoter vectors indicated. Total cell 

extracts equivalent to ~lxl 0 4 293T cells, 24hours post transfection, were loaded onto 
the gel. The left blot was probed with an anti-nef antiserum, and the right blot was 
probed with anti-gpl20 antiserum. 
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15 

Example 5: Preparation of plasmid-coated 'gold slurry 9 for 'gene gun' DNA 
cartridges 

Plasmid DNA (approximately l(ag/(al) ? eg. 100 ug, and 2|Lim gold particles, eg. 50 mg, 
20 (PowderJect), were suspended in 0.05M spermidine, eg. 100 ul, (Sigma). The DNA 
was precipitated on to the gold particles by addition of 1M CaCl 2 , eg. lOOul 
(American Pharmaceutical Partners, Inc., USA). The DNA/gold complex was 
incubated for 10 minutes at room temperature, washed 3 times in absolute ethanol, eg. 
3x1 ml, (previously dried on molecular sieve 3 A (BDH)). Samples were resuspended 
25 in absolute ethanol containing 0.05mg/ml of polyvinylpyrrolidone ( PVP, Sigma), and 
split into three equal aliquots in L5 ml microfuge tubes, (Eppendorf). The aliquots 
were for analysis of (a) 'gold slurry', (b) eluate- plasmid eluted from (a) and (c) for 
preparation of gold/ plasmid coated Tefzel cartridges for the 'gene gun 5 , (see Example 
3 below). For preparation of samples (a) and (b), the tubes containing plasmid DNA / 
30 'gold slurry' in ethanol / PVP were spun for 2 minutes at top speed in an Eppendorf 
5418 microfuge, the supernatant was removed and the 'gold slurry' dried for 10 
minutes at room temperature. Sample (a) was resuspended to 0.5 - 1.0 ug / ul of 
plasmid DNA in TE pH 8.0, assuming approx. 50 % coating. For elution, sample (b) 
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was resuspended to 0.5 - 1 .0 ug / ul of plasmid DNA in TE pH 8.0 and incubated at 
37°C for 30 minutes, shaking vigorously, and then spun for 2 minutes at top speed in 
an Eppendorf 5418 microfuge and the supernatant, eluate, was removed and stored at - 
20°C. The exact DNA concentration eluted was determined by spectrophotometric 
5 quantitation using a Genequant II (Pharmacia Biotech). 

Example 6: Preparation of Cartridges for DNA immunisation 

Preparation of cartridges for the Accell gene transfer device was as previously 
10 described (Eisenbraun et al DNA and Cell Biology, 1993 Vol 12 No 9 pp 791-797; 
Pertner et al). Briefly, plasmid DNA was coated onto 2 \im gold particles (DeGussa 
Corp., South Plainfield, N.J., USA) and loaded into Tefzel tubing, which was 
subsequently cut into 1.27 cm lengths to serve as cartridges and stored desiccated at 
4°C until use. In a typical vaccination, each cartridge contained 0.5 mg gold coated 
1 5 with a total of 0.5 jug DN A/cartridge. 



Example 7: PMID immunisations including using gp!20-Nef-Tat triple fusion 
lacking gp!20 secretion signal 

20 

Protocol: For PMID immunisations (DNA) cartridges were prepared for using 
standard methods as described in Examples 5 and 6. A DNA loading rate of 2, which 
will give approximately 0.5 |ug DNA/cartridge was used and each immunisation 
consisted of two shots. Balb/c mice were given a primary immunisation of DNA 

25 (using PMID). The mice were boosted 28 days later with DNA (using PMID). Mice 
were culled 7 days later and serum and spleens were collected. The splenocytes were 
harvested by teasing out the spleen cells and erythrocytes were lysed. The splenocytes 
were washed and counted. Specialised ELIspot plates (coated with interferon-gamma 
capture antibody and blocked) were used. Splenocytes were transferred to these plates 

30 and incubated overnight at 37°C/5% CO2 in the presence of a gpl20 peptide, RT 
peptide or Gag peptide. The splenocytes were lysed and the plate developed using 
standard procedures to demonstrate the number of interferon-gamma secreting cells 
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present. Serum was analysed by ELISA assay to detect for specific antibodies. Results 
are shown in Figures 32-34. 



Conclusion 

5 Unexpectedly, the cellular immune response of mice immunised with dsgpl20 (gpl20 
lacking secretion signal) expressing constructs was approximately double that of mice 
immunised with gpl20 constructs (see Figure 32 and 33). This was consistent with 
the observation that in in vitro transfection studies the expression of dsgpl20 had 
remained largely cell associated, whereas gpl20 had been excreted. 

10 

Inclusion of Tat (mutated Tat) in the dsgpl20 constructs increased the cellular 
immune response to twice that of the dsgpl20 constucts without Tat (Figures 32 and 
33). Tat on its own did not affect the immune response to gpl20, but acted 
synergistically with dsgpl20 to optimise the cellular response. 

15 

The inclusion of other HIV antigens in the constructs or dual promoter vectors 
produced a balanced cellular response to all the different antigens included and thus 
broadened the immune response compared to the gpl20 only vectors (Figure 34). 

20 
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CLAIMS 

1 . A polynucleotide which comprises a sequence encoding an HIV envelope 

5 protein or fragment or immunogenic derivative thereof, which is substantially non- 
glycosylated when expressed in a mammalian target cell, operably linked to a 
heterologous promoter. 

2. The polynucleotide according to claim 1 wherein the HIV envelope protein or 
1 0 fragment or immunogenic derivative thereof is gp 120 or a fragment or immunogenic 

derivative thereof. 

3. The polynucleotide according to claim 1 or claim 2 wherein the envelope 
protein lacks a functional secretion signal. 

15 

4. The polynucleotide according to claim 2 or claim 3 wherein the gpl20 is 
expressed as a fusion protein comprising at least one other HIV protein or fragment or 
immunogenic derivative thereof. 

20 5. The polynucleotide according to claim 4 wherein the at least one other HIV 
protein or fragment or immunogenic derivative is selected from Nef, Gag, RT or Tat. 

6. The polynucleotide according to claim 5 wherein the gpl20 encoding sequence 
is linked to a sequence encoding HIV RT or a fragment or immunogenic derivative 

25 thereof and a sequence encoding HIV Gag or a fragment or immunogenic derivative 
thereof and a sequence encoding HIV Nef or a fragment or immunogenic derivative 
thereof to encode a gpl20, RT, Gag and Nef-containing fusion protein. 

7. The polynucleotide according to claim 6 wherein the fusion is selected from 
30 gp 1 20-RT-Nef-Gag and RT-Net Gag-gp 1 20. 



53 



WO 2004/041852 PCT/EP2003/012402 

8. The polynucleotide according to claim 5 wherein the gpl20 sequence is linked 
to a sequence encoding HIV Tat or a fragment or immunogenic derivative thereof and 
a sequence encoding HIV Nef or a fragment or immunogenic derivative thereof to 
encode a gpl20, Nef and Tat-containing fusion protein. 

9. The polynucleotide according to claim 8 wherein the fusion is a gpl20-Nef- 
Tat fusion. 



10. The polynucleotide according to claim 8 wherein the gpl20 is further linked to 
a sequence encoding HIV Gag or a fragment or immunogenic derivative thereof to 
encode a gpl20, Nef, Tat and Gag-containing fusion protein. 

11. The polynucleotide according to claim 10 wherein the fusion is a gpl20-Gag- 
Nef-Tat fusion. 

12. The polynucleotide according to any one of claims 5, 6, 7, 10 or 1 1 wherein 
the Gag comprises PI 7 and/or 24. 



13. The polynucleotide according to any one of claims 1 to 12 wherein one or 
20 more of the sequences encoding gp 120 ? Nef, Gag, RT or Tat or fragment or 

immunogenic derivative thereof is or are codon optimised to resemble the codon 
usage in a highly expressed human gene. 



14. A polynucleotide sequence selected from the group: 



25 



1. gpl20 codon optimised, minus secretion signal 

2. gpl20 codon optimised, minus secretion signal - tr Nef 

3. gpl20 codon optimised, minus secretion signal - tr Nef- mTat 

4. gpl20 codon optimised, minus secretion signal - Nef - mTat 

30 5 . gp 1 20 codon optimised, minus secretion signal - p 1 7/24 Gag - tr Nef 

6. gpl20 codon optimised, minus secretion signal - pi 7/24 Gag - tr Nef - mTat 

7. gpl20 codon optimised, minus secretion signal - pi 7/24 Gag - Nef-mTat 

8. gpl20 codon optimised, minus secretion signal - p 1 7/24 Gag - mNef-mTat 
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9. gpl20 codon optimised, minus secretion signal - pl7/24 Gag - LINef-mTat 

10. gpl20 codon optimised, minus secretion signal - pi 7/24 Gag - L2Nef-mTat 

11. gpl20 codon optimised, minus secretion signal - pi 7/24 Gag - LLNef-mTat 

12. gpl20 codon optimised, minus secretion signal - pi 7/24 Gag - mLLNef-mTat 

13. gpl20 codon optimised, minus secretion signal - pl7/24 Gag - mLlNef-mTat 

14. gpl20 codon optimised, minus secretion signal - pi 7/24 Gag - mL2Nef-mTat 

15. gpl20 codon optimised, minus secretion signal - mRT- trNef - pi 7/24 Gag 

1 6. mRT - trNef - pi 7/24 Gag - gpl20 codon optimised, minus secretion signal 
Wherein the RT and Gag are codon optimised. 



10 



15. The polynucleotide according to any one of claims 1 to 14 wherein the 
promoter is the promoter from HCMV IE gene. 



16. The polynucleotide according to claim 15 wherein the 5 5 untranslated region 
1 5 between the promoter and coding polynucleotide comprises exon 1 . 

17. A set of polynucleotides comprising a polynucleotide according to any of 
claims 1 to 16 and at least one further polynucleotide encoding at least one of HIV 
Nef, Gag, RT or Tat or fragment or immunogenic derivative thereof. 

20 

18. The set of polynucleotides according to claim 17 wherein the polynucleotides 
are contained on a single vector under the control of two or more separate promoters. 

19. The set of polynucleotides according to claim 17 or 18, encoding a gp 120 or 
25 fragment or immunogenic derivative and a fusion of RT-Nef-Gag or of fragments or 

immunogenic derivatives thereof. 

20. A set of polynucleotides according to claim 17 or claim 18 selected from the 
following: 

30 1 . gpl20 codon optimised, minus secretion signal + tr Nef-mTat 

2. gpl20 codon optimised, minus secretion signal + P17/24 Gag - tr Nef 

3. gpl20 codon optimised, minus secretion signal + PI 7/24 Gag - Nef - mTat 

4. mRT — tr Nef- P17/24 Gag + gpl20 codon optimised, minus secretion signal 
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5. gpl20 codon optimised, minus secretion signal + mRT 
wherein RT and Gag are codon optimised. 
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-trNef-P17/24 Gag 



21. A vector comprising a polynucleotide or polynucleotides as claimed in any one 
5 of claims 1 to 20. 

22. The vector according to claim 21 which is a double stranded DNA plasmid. 

23. The vector according to claim 21 which is a replication defective adenovirus 
1 0 vector. 

24. The vector according to claim 23 which is derived from Pan 9, 5, 6 or 7. 

25. A fusion protein comprising a substantially non-glycosylated HIV envelope 
15 protein or a fragment or immunogenic derivative thereof and at least one additional 

HIV protein or fragment or immunogenic derivative thereof said additional HIV 
protein selected from Nef, Gag, RT and Tat. 

26. A composition comprising a substantially non-glycosylated HIV envelope 
20 protein or a fragment or immunogenic derivative thereof and at least one additional 

HIV protein or fragment or immunogenic derivative thereof, preferably in the form of 
a fusion protein, said at least one additional HIV protein selected from Nef, Gag, RT 
and Tat. 

25 27. A polypeptide encoded by the polynucleotide or vector according to any of 
claims 1 to 24. 

28. A pharmaceutical composition comprising a nucleotide sequence or a set of 
nucleotide sequences according to any one of claims 1 to 20, a vector of claim 21 to 
30 24, a fusion proten according to claim 25 or a composition according to claim 26 or a 
polypeptide according to claim 27, and a pharmaceutically acceptable excipient, 
diluent, carrier or adjuvant. 
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29. The pharmaceutical composition according to claim 28 wherein the carrier is a 
plurality of particles such as gold beads. 



30. The pharmaceutical composition according to claim 28 or 29 for delivery in a 
5 prime boost format. 

3 1 . An intradermal delivery device comprising a pharmaceutical composition 
according to any one of claims 28 to 30. 

10 32. A method of treating a patient suffering from or susceptible to a disease 
comprising administering a safe and effective amount of a pharmaceutical 
composition according to any one of claims 28 to 30. 

33. A polynucleotide or a vector or fusion protein or composition or polypeptide 
15 according to any one of claims 1 to 27 for use in medicine. 

34. Use of a polynucleotide or a vector or fusion protein or composition or 
polypeptide according to any one of claims 1 to 27 in the manufacture of a 
medicament for the treatment of disease. 

20 

35. A process for the production of a polynucleotide according to any one of claim 
1 to 20 comprising linking a nucleotide sequence encoding a substantially non- 
glycosylated HIV envelope molecule or fragment or immunogenic derivative thereof 
and optionally a sequence encoding an HIV regulatory protein or fragment or 

25 immunogenic derivative thereof, to a heterologous promoter sequence. 
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Figure 1 
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Figure 2 



Map of pgpl20c: 




The amino acid sequence of the W61D gpl20 is below. The signal sequence is underlined and bold, up to 
the predicted cleavage site between amino acids 29 and 30. This is the sequence removed in dsgpl20 
(pRixl2 etc). 

M KVKETRKNYQHLWRWGTMLLGM 

ACVPTDPNPQEVVLGNVTEYFNMWK^ 

SNGWTGE I RKGE I KNC S FN I TT S I RD KVQKE YAL F YNLD WP I DDDNATT KMKTTRNFRL I HCNS S VMTQA 
CPKVSFEPIPIHYCAPAGFAILKCNNKTFDGKGLCTNVSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSD 

NFMDNTKTI I VQLNESVAINCTRPNNNTRKGIHIGPGRAFYAARKI IGDIRQAHCNLSRAQWNNTLKQIVI 
' KLREHFGNKTIKFNQSSGGDPEIVRHSFNCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQII 
NMWQEVGKAMYAPPIGGQIRCSSNITGLLLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWK^ 

EPLGVAPTRAKRRWQR [SEQ ID NO: 49] 

The codon optimised DNA sequence for the W61D gpl20 gene is: 

ATGAAGGTCAAGGAGACCAGAAAGAACTACCAGCATCTGTGGCGCTGGGGCACCATGCTCCTGGGAATGCT 
GATGATCTGCTCCGCCGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCA 
CGACCACCCTCTTCTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATT^CGTGTGGGCGACGCAT 
GCTTGCGTGCCTACGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTG 
GAAGAATAACATGGTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCG 
TGAAGCTGACGCCTCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACC 
AGCAACGGCTGGACCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGAT 
CAGAGACAAGGTGC AGAAGGAATACGCGCTGTTTTATAAT CT CGATGTGGTC C C CATCGACGACGACAATG 
CCACCACCAAGAACAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCC 
TGCCCCAAGGTGTCCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTG 
TAACAACAAGACCTTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCC 
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Figure 2 continued 



GCCCCGTCGTGAGCACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGAC 
AACTTCATGGACAACACCAAGACAATCATCGTCGAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCC 
TAACAACAACACCCGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCG 
GCGACATCCGGCAGGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATC 

AAGCTGAGAGAGC AC TTTGGAAACAAGAC CATC AAGTTC AATC AGAGTTCTGGCGGAGACC CCGAGATCGT 
GCGGCACTCCTTCAACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGA 

ACGGCAC CGAGGGC AAC AAC AC AGAGGGAAACTC C ACTAT C AC C CT C C CTTGC CGC ATCAAGCAGATC ATC 
AACATGTGGCAGGAGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAA 
CATCACCGGCCTGCTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCT 
TCAGGCCCGGCGGCGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTG 
GAGCCGCTCGGCGTGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGATGA [SEQ ID NO : 5 0] 
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Figure 3 

Map of pRixl5244: 
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Figure 4 

Plasmid pNTm: 




'Notl 



Sequence of insert: 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAQACGAGCTGA 
GCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATA 
CAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACA 
CCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGG 
GGGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAG 
GCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGC 
TACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACA 
CCCTGTGAGCCTGCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAG 
CATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCT 
AGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTG 
TTGCTTTCATTGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGAC 
AGCGACGAAGACCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCC 
AAAGGGGAGCCGACAGGCCCGAAGGAATAA [SEQ ID NO: 51] 

Amino acid sequence of antigen: 

MGGKWSKSSWGWPTVRERMRRAEPAA 
EEVGFPVTPQWLRPMTYKAAVmSHFLKEK^ 

PGPGWYPLTFGWCYKLWVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFH 
HVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVC 
KRRQRRKPPQGSQTHQVSLSKQPTSQSKGEPTGPKE [SEQ ID NO: 52] 
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Figure 5 

Plasmid ptrNTm: 




*Notl 



Sequence of insert: 

ATGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAG 
CCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATC 
TGTGGATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATAT 
CCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGG 
AGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGT 
GGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGC 
ACTAGTGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTAC 
CAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCT 
ATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATC A 
AAGCAACCCACCTCCCAATCCAAAGGGGAGCCGACAGGCCCGAAGGAATAA [SEQ ID NO : 531 



Amino acid sequence of antigen: 

MVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRY 

PLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 

TSEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLS 
KQPTSQSKGEPTGPKE [SEQ ID NO: 54] 
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Figure 6 

Plasmid pRixl2: 




Sequence of insert: 

ATGGCC GAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 

CTGCG CGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACG CATGCTTGCGTGCCTA 

CGGAC CCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 

GTG GATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGC GTGAAGCTGACGCC 

TCTC TGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGC AACGGCTGGA 

CCG GAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCG ATCAGAGACAAGGTG 

CAGAA GGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGC CACCACCAAGAA 

CAA GACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCC CCAAGGTGT 

CCTTCGAAC CAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTG TAACAACAAGACC 

TTC GACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCC GCCCCGTCGTGAG 

CACC CAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACA ACTTCATGGACA 

ACACCA AGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTA ACAACAACACC 

CGTA AGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCAT CGGCGACATCCGGCA 

GGCCCATT GCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTT GGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGAT CGTGCGGCACTCCTTC 

AACTGCGGGG GCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGG CACCGAGGG 

naAgAACACA GAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATC ATCAACATGTGGCAGG 

AGGTGGGA AAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCA CCGGCCTG 

CTGCTCACC AGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGG CCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAA GGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGATGA [SEQ ID NO: 55] 

Amino acid sequence of antigen: 
MAEQLWVTVYYGVPVWKEATTTL 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDVISrTTNSTTTTSNGWTGEIRKGEIKNCSFNITTSIRDKV 

QKEYALFYNLDVVPIDDDNATTKNKTTRN^ 
FDGKGLCTNVSTVQCTHGIRPWSTQLLLNGSL^ 

RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWl^TLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGmTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 

LLTRDGGTEGNGTENETE I FRPGGGDMRDNWRS ELYKYKWKVEPLGVAPTRAKRRWQR [SEQ I 

NO: 56] 



7/53 



WO 2004/041852 PCT/EP2003/012402 

Figure 7 

PlasmidpRix28: 



BamHI 




Sequence of insert: 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGj^AGGAGGCCACGACCACCCTCTT 

CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 

CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 

GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 

TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 

CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 

CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 

CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 

TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 

CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 

AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTT 

G GATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCG 

AGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAG 

A AGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAG 

G CAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAACGAAG 

ACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAG 

GGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTA 

GAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGA 

GAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGG 

AGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAG 
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Figure 7 continued 



CCTAAA ACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAGC 
TGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCA GTCAGACTC 
ATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAGCCGACAGGCCCGAAGGAATAA 

[SEQ ID NO: 57] 
Amino acid sequence of antigen: 
MAEQLWVTVYYGVPVWKEATTT 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDWTTNSTTTTSNGWTGEIRKGEIKMCSFNITTSIRDKV 
QKE YAL FYNLDWP I DDDNATTKNKTTRNFRL I HCNS S VMTQACPKVS FE P I P IHYC AP AGF AI LKCNNKT 
FDGKGLCTNVSTVQCTHGIRPWSTQLL^^ 

RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWNlSrTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLP 

LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMGGKWSKSSW 
GWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVPLRPMTYK 
AAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKV 
EEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQ 
PKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGEPTGPKE 

[SEQ ID NO: 58] 
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Figure 8 

PlasmidpRix29: 



BamHl'*' 




Sequence of insert: 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 

CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 

CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 

GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 

TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 

CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 

CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 

CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 

TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 

CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 

AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGTGGGTTTTCCAGTCACACCTCAGGTACCT 

TTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGG 

GCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTG 

ATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTA 

CCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCT 

GCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACG 

TGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTAGACTAGAGCCC 

TGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTG 

CCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGAC 

CTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAGCCG 

AC AGG CC C G AAGGAAT AA [SBQ ID NO: 59] "" 
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Figure 8 continued 



Amino acid sequence of antigen: 
MAEQLWVTVYYGVPWKEATO^ 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDWTTNSTTTTSNGWTGEIRKGEIKNCSFNITTSIRDKV 
QKEYALFYNLDVVPIDDDNATTKNKTTRNFRLIHCNS SVMTQACPKVSFEPIPIHYCAPAGFAILKCNNKT 

FDGKGLCTNVSTVQCTHGIRPWSTQL^^ 
RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQT^ 

NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 
LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMVGFPVTPQVP 
LRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLV 
PVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEP 
WKHPGSQPKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGEP 

TGPKE [SEQ ID NO: 60] 
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Figure 9 

Plasmid pRix31: 



BamHI 




Sequence of insert: 

ATGGC CGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 

CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 

CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 

GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 

TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 

CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 

CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 

CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 

TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 

CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGGGATTAACTGTACCCGGCCTAACAACAACACC 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCA TCAACATGTGGCAGG 

AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGTGGGTTTTCCAGTCACACCTCAGGTACCT 

TTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGG 

GCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTG 

ATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTAC AAGCTAGTA 

CCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCT 

GCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCAT TTCATCACG 

TGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCTAA [ SEQ ID NO : 61] 



Arnino acid sequence of antigen: 
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Figure 9 continued 



MAEQLWTVYYGVPWKEATTTLFCA^ 

VDQMHED 1 1 SLWDQSLKPCVKLTPLCVTLDCDDVlSrTTNSTTTTSNGWTGE IRKGE I KNCS FNITTS IRDKV 
QKEYALFYNLDVVPIDDDNATTKNKTTRNFRL^ 

FDGKGLCTNVSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPNNNT 
RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWNNTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 
LLTRDGGTEGWGTENETEIFRPGGGDMRDNWRSELYKYKVVKVEPLGVAPTRAKRRWQRMVGFPVTPQVP 
LRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLV 
PVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 

[SEQ ID NO: 62] 
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Figure 10 




Sequence of insert: 

tgggtgcccgagcttcggtactgtctggtggagagctggacagatgggagaaaattaggctgcgcccggga 
ggcaaaaagaaatacaagctcaagcatatcgtgtgggcctcgagggagcttgaacggtttgcggtgaaccc 
aggcctgctggaaacatctgagggatgtcgccagatcctggggcaattgcagccatccctccagaccggga 
gtgaagagctgaggtccttgtataacacagtggctaccctctactgcgtacaccagaggatcgagattaag 
gataccaaggaggccttggacaaaattgaggaggagcaaaacaagagcaagaagaaggcccagcaggcagc 
tgctgacactgggcatagcaaccaggtatcacagaactatcctattgtccaaaacattcagggccagatgg 
ttcatcaggccatcagcccccggacgctcaatgcctgggtgaaggttgtcgaagagaaggccttttctcct 
gaggttatccccatgttctccgctttgagtgagggggccactcctcaggacctcaatacaatgcttaatac 
cgtgggcggccatcaggccgccatgcaaatgttgaaggagactatcaacgaggaggcagccgagtgggaca 
gagtgcatcccgtccacgctggcccaatcgcgcccggacagatgcgggagcctcgcggctctgacattgcc 
ggcaccacctctacactgcaagagcaaatcggatggatgaccaacaatcctcccatcccagttggagaaat 
ctataaacggtggatcattctcggtctcaataaaattgttagaatgtactctccgacatccatccttgaca 
tt agacagggacc caaagagccttttagggattacgtcgac cggttttataagac c ctgcgagc agagcag 
gcctctcaggaggtcaaaaactggatgacggagacactcctggtacagaacgctaaccccgactgcaaaac 
aatcttgaaggcactaggcccggctgccaccctggaagagatgatgaccgcctgtcagggagtaggcggac 
ccggacacaaagccagagtgttg*atggtgggttttccagtcacacctcaggtacctttaagaccaatgac 
ttacaaggcagctgtagatcttagccactttttaaaagaaaaggggggactggaagggctaattcactccc 
aaagaagacaagatatccttgatctgtggatctaccacacacaaggctacttccctgattggcagaactac 
acaccagggccaggggtcagatatccactgacctttggatggtgctacaagctagtaccagttgagccaga 
taaggtagaagaggc caataaaggagagaac ac c agcttgttac ac c ctgtgagcctgcatgggatggatg 

acccggagagagaagtgttagagtggaggtttgacagccgcctagcatttcatcacgtggcccgagagctg 
catccggagtacttcaagaactgctga [seq id no: 63] 
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Plasmid pRix33: 



BamHl..., 




Sequence of insert: 

ATGQCCGAQCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 
CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 
CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 
GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 
TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 
CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 
CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 
CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

ccttcgaaccaatcccgatccattactgtgcccctgccggattcgcgatcctcaagtgtaacaacaagacc 
ttcgacgggaagggcctgtgcaccaacgtcagcacggtgcagtgcacccatggcatccgccccgtcgtgag 
cacccagctgctgctgaacgggtccctggctgaggaggaggtggtgatccggtcggacaacttcatggaca! 
acaccaagacaatcatcgtccagctgaacgagtctgtggcgattaactgtacccggcctaacaacaacacc 
cgtaagggcatccacatcgggcctggacgggccttctatgccgcccgcaagatcatcggcgacatccggca 
ggcccattgcaacctctcccgcgcccagtggaataacaccctgaagcagatcgtgatcaagctgagagagc 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 
AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 
CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 
AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 
CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 
CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 
TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 
GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 
GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 
AGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTG 
GCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGA 
GGAGC2LAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCAC 
AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 
gCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTG A 
GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 
TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 
CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 
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ATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATTCTCGGTCTCAAT^ 
AAATTGTTAGAATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAAGAGCCTTTTAGGGAT 
TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 
GACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCC 
TGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGTGGGT 
TTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTT 
AAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCT 
AC C AC AC AC AAGGC T ACTT C C CTG AT T GG C AG AACT AC AC AC C AGGGC C AGGGGTC AG AT AT CC ACTGACC 
TTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACAC 
CAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTG 
ACAGCCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCTAA 
[SEQ ID NO: 64] 

Amino acid sequence of antigen: 

MAEQLWVTVYYGVPWKEATTTLFC^ 
VDQMHEDIISLWDQSLKPCVKLTPLCVT^ 

QKE YAL F YNLD VVP I DDDNATTKNKTTRNFRL I HCNS S VMTQAC PKVS FE P I P I H YC AP AGFAI LKCNNKT 
FDGKGLCTWSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPN1SINT 
RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWN1NITLKQIVIKLREHFGWKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 
LLTRDGGTEGNGTENETE I FRPGGGDMRDNWRS EL YKYKVVKVE PLGVAPTRAKRRWQRMGARAS VL S GG 
ELDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 
ATLYCVHQR I E I KD TKEALDKI EEEQNKS KKKAQQAAADTGHSNQVS QNYP I VQNI QGQMVHQAI S PRTLN 
AWVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAM^ 

PGQMREPRGSDIAGTTSTLQEQIGWMTIOTPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRD 
YVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCK^ 

FPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLT 
FGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 

[SEQ ID NO: 65] 
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Plasmid pRix35: 



BamHU, 




Sequence of insert 

ATGGCCGAQCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTQTGGAAGGAGGCCACGACCACCCTCTT 

CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 

CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 

GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCG 

TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 

CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 

CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 

CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 

TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 

CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 

AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 

GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 

GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 

AGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTG 

GCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGA 

GGAGCAA AACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCAC 

AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 

GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGA 

GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 

TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 

CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 
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ATGQATGACCAACAATCCT CCCATCCCAGTTGQAQAAfl.TCTATAAACGGTGG ATC!ATTr!Tf rawrfTr'a a»P7v 

AAATTGTTAGAATGTACTCTC C GACATCCATCCTTGACATTAGA CAGGGACCCAAA GAGCCTTTTAnffraaT 

TACGTCGA CCGGTT TTATAAGACCCTGCGAGCAGAGCAGGCCTCT CAGGAGGTCAAAAA CTGGATGAPfl^ 

gACACTCCTGGTACAGAACGC T AACCCCGACTGCAAAACAATCT TGAAGGCACTAGG CCCGGCTGnnA -7^ 

TGGAAGAGAT GATGAC CGCCTGTCAGGGAGTAGGCGGACCCGGA CACAAAGCCAGAGT GTTBAT^aTn -^ 

yTTCCAGTCACACCTCAGGTA C CTTTAAGACCAATGACTTACA AGGCAGCTGTA GATCTTAGr!C!Ar!TTT"r'P 

AAAAGAAAAGGGGGGACTGGAAGGGCTAATTC A CTCCCAACGA AGACAAGATATCCTTG ATCTflT« HI^ 

ACCACACACAAGGCTACTTC CCTGATTGGCAGAACTACACACCAGGGCCAG G GGTCAGATATPPar'TQar'i-' 

TTTGGATGGTGCTACAAGCTAG T ACCAGTTGAGCCAGATAAGGTAG AAGAGGCCAATA AAGGAGAPiAAr'AC 
CAGCTTGTTACACCCTGTGAGCC T GCATGGAATGGATGA^ 

ACAGCCQCCTAGCATTTCATC ^^^ 

CCAGTAGATCCTAGACTAG A GCCCTGGAAGCATCCAGGAAGTCAGCCTA A AACTGCTTflTA^ a An.Tr.r.>P^ 

! T !! A AA A AGTGTTGCT TTC ^^^ 

AGAAGCGGAGACAGCGACG A AGACCTCCTCAAGGCAGTCAGACTCATC; AA t3TTTr"pr"p 
ACCTCCCAATCCAAAGGGGAGCCGACAGGCCCGAAGGAATAA [SEP ID HO: Sfil 

Amino acid sequence of antigen: 
MAEQLWVTVYYGVPVWKEATTTLFCASDAKAYDT^ 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDVNTTNSTTTTSNGWTGEIRKGEIKNCSFNITTSIRDKV 
QKE YALF YNLD WP IDDDNATTKNKTTRNFRL IHCNS S VMTQACPKVS FEP I P IHYCAPAGFAILKCN1TKT 

FDGKGLCTNVSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPNNNT 
^™ IGPG ^ FY ^ KIIGDIR Q MCNLSI ^Q W1 ^TLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 
LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMGARASVLSGG 
ELDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 
ATLYCVHQRIEIKDTKEALDKIEEEQNKSKKKAQQAAADTGHSNQVSQNYPIVQNIQGQMVHOAISPRTLN 
AWVKWEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAEWDRVHPVHAGPIA 
PGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRD 
YVDRFYKTLRAEQASQEVKNWTETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARVLMVG 
FPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLT 
FGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSE 
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Sequence of insert: 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 

CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATG CTTGCGTGCCTA 

C GGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTG GAAGAATAACATG 

GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 

T CTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 

CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 

C AGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 

CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 

TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 

CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 

AGGTGGGl^AAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 

GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 

GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 

AGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTG 

GCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGA 

GGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCAC 

AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 

GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGA 

GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 

TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 

CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 

ATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATTCTCGGTCTCAATA 

AAATTGTTAGAATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAAGAGCCTTTTAGGGAT 
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TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 

GACAC TCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCG GCTGCCACCC 

TGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGGTGGC 

AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGC 

AGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTA 

CCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTA 

CCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGA 

AGGGC TAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGG CTACTTCC 

CTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTA 

GTACC AGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAG 

CCTGCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATC 

AC GTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATC CTAGACTAGAG 

CCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCA 

TTGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAA 

GACCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAG 

CCGACAGGCCCGAAGGAATAA [SEQ ID NO: 68] 

Amino acid sequence of antigen: 
MAEQLWVTVYYGVPVWKEATTTL 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDWTTNSTTTTSNGWTGEIRKGEIKNCSFNITTSIRDKV 

QKEYALFYNLDWPIDDDNATTKNKTTRNFRLIHCNSSVMTQACPKVSFEPIPIHYCAPAGFAILKCNNKT 

FDGKGLCTNVSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPNIS^T 

RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWNNTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 

NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIimWQEVGKAMYAPPIGGQIRCSSNITGL 

LLTRDGGTEGNGTENETEIFRPGGGDMRDMWRSELYKYKWKVEPLGVAPTRAKRRWQRMGARASVLSGG 

ELDRWEKIRLRPGGKKKYKLKHIWASRELERFAWPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 

ATLYCVHQRIEIKDTKEALDKIEEEQNKSKKKAQQAAADTGHSNQVSQNYPIVQNIQGQMVHQAISPRTLN 

AWKWEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAEWDRVHPVHA 

PGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRD 

YVDRFYKTLRAEQASQEVKNWMTETLLVQN^ 

KWSKSSVVGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQV 
PLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKL 
VPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKMCTSEPVDPRLE 
PWKHPGSQPKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGE 

PTGPKE [SEQ ID NO: 69] 
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pRix40 




DNA sequence of insert „„„ mmm 
ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 

CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 

CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 

GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 

TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 

C CGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTC CTTC AATATCACGAC CT CGATCAGAGACAAGGTG 

CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 

CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 

TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 

CACGCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 

AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 

GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 

GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 

AGAT c ctggggcaattgcagc c at c c ctc c AGACCGGGAGTGAAGAGCTGAGGTC CTTGTATAAC AC agtg 
GCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGA 
GGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCAC 
AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 
GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGA 
GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 
TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 
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CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 

ATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATTCTCGGTCTCAATA 

AAATTGTT AGAATGTACTCTC CGAC ATC C ATC CTTGACATT AGAC AGGGAC CCAAAGAGC CTTTTAGGGAT 

TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 

GACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCC 

TGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGGCAAG 

TGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGCAGA 

TGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCA 

ATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCT 

TTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGG 

GCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTG 

ATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTA 

CCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCT 

GC ATGGAATGGATGAC C CTGAGAGAGAAGTGTTAGAGTGGAGGTTTGAC AGC CGCCTAGC ATTTCAT C ACG 

TGGC C CGAGAGCTGC AT C CGGAGTACTTCAAGAACTGC ACTAGTGAGCC AGTAGATCCT AGACTAGAGC C C 

TGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTG 

CCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGAC 

CTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAGCCG 

AC AGGCC CGAAGGAATAA [SEQ ID NO : 70] 

Aminoacid sequence of insert 
MAEQLWVTVYYGVPVWKEATTTLFCASDA 

VDQMHSDIISLWDQSLKPCVKLTPLCVTLDCDDWTTNSTTTTSNGWTGEIRKGEIKNCSFNITTSIRDKV 

QKE YALF YNLD WP IDDDNATTKNKTTRNFRL IHCNS S VMTQACPKVS FEP I P IHYCAPAGFAI LKCNNKT 
FDGKGLCTNVSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPNNNT 

rkgihigpgrafyaarkiigdirqahcnlsraqw™tlkqiviklrehfgnktikfnqssggdpeivrhsf 

NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 

LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMGARASVLSGG 

ELDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 

ATLYCVHQRIEIKDTKEALDKIEEEQNKSKKKAQQAAADTGHSNQVSQNYPIVQNIQGQMVHQAISPRTLN 

AWVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAEWDRVH 

PGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRD 

YVDRFYKTLRAEQASQEVKtsTWMTETLL^ 

WSKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVP 
LRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDIL^ 

PVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEP 
WKHPGSQPKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGEP 

TGPKE [SEQ ID NO: 71] 
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pRix41 




DNA sequence of insert 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 

CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 

CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 

GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 

TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 

CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 

CAGAAGG2VATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 

CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 

TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCGCCGTCGTGAG 

CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGC C CATTGCAAC CT CTC C CGCGC C C AGTGGAATAAC AC CCTGAAGC AGAT CGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAAGTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 

AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 

GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 

GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 

AGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTG 

GCTAC CCTCTACTGCGTAC AC C AGAGGATCGAGATTAAGGAT AC CAAGGAGGC CTTGGACAAAATTGAGGA 

GGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCAC 

AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 

GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGA 

GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 
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TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 
CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACAGTGCAAGAGCAAATCGG 
ATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATTCTCGGTCTCAATA 

AAATTGTTAGAATGTACTCTC CGAC AT C CAT C CTTGAC ATT AGACAGGGAC C C AAAGAGC CTTTT AGGGAT 
TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 
GACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCC 
TGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGGTGGC 
AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGC 
AGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTA 
CCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACAGCTCAGGTA 
CCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGA 
AGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCC 
CTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTA 
GTAC C AGTTGAGC CAGATAAGGTAGAAGAGGC CAAT AAAGGAGAGAAC AC C AGCGC CTTAC ACC CTGTGAG 
CCTGCATGGAATGGATGAC C CTGAGAGAGAAGTGTTAGAGTGGAGGTTTGAC AGC CGC CTAGCATTTC ATC 
ACGTGGC C CGAGAGCTGC ATC CGGAGTACTT C AAGAACTGC ACTAGTGAGC C AGTAGATC CTAGACTAGAG 
CCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCA 
TTGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAA 
GACCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAG 

C CGACAGGCC CGAAGGAATAA [SEQ ID NO : 72] 

Arninoacid sequence of insert 
MAEQLWVTVYYGVPWKEATTTLFC^^ 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDWTTNSTTTTSNGWTGEIRKGEIKNCSFNITTSIRDKV 

QKE YALF YNLD WP IDDDNATTK3STKTTRNFRL IHCNS S VMTQAC PKVS FEPIP IHYC AP AGFAI LKCNNKT 
FDGKGLCTNVSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPNNNT 

RKGIHIGPGRAF YAARKI IGD I RQAHCNLSRAQWNNTLKQ I VI KLREHFGNKT I KFNQ S S GGDPE I VRHS F 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 

LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMGARASVLSGG 

ELDRWEKIRLRPGGKKKYKLKHIWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 

ATLYCVHQRIEIKDTKEALDKIEEEQNKSKKKAQQAAADTGHSNQVSQNYPIVQNIQGQMVHQAISPRTLN 

AWVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAM 

PGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRD 
YVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKTILKAL 

KWSKSSWGWPTTOERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQV 
PLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKL 
VPVEPDKVEEANKGENTSALHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLE 

P WKHPG S Q P KT ACTNC Y C KKC C FHC Q VC F I T AAL G I S YGRKKRRQRRRP P QG S QTHQ VS L S KQ P T S Q S KGE 

PTGPKE [SEQ ID NO: 73] 
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Figure 16 

pRix42 




DNA sequence of insert 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 
CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGGCTA 

CGGAC CC C AAC C C CC AGGAGGTGGTGCTGGGAAACGTGAC CGAGT ACTTCAACATGTGGAAGAATAAC ATG 
GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 
TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 
CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 

C AGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTC C C C ATCGACGACGACAATGC C AC C AC C AAGAA 
CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 
CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 
TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 
CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

AC ACC AAGAC AATC AT CGT C C AGCTGAACGAGTCTGTGGCGATTAACTGTAC C CGGC CTAAC AACAAC AC C 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 

AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 

GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 

GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 

AGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTG 

GCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGA 

GGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCAC 

AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 

GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGA 

GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 
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TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 
CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 
ATGGATGAC C AACAATCC TCC C ATC C C AGTTGGAGAAAT CTATAAACGGTGGATC ATTCTCGGTCTCAATA 
AAATTGTTAGAATGT ACT CTC CGACATC CATC CTTGACATTAGACAGGGAC C C AAAGAGC CTTTT AGGGAT 
TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 
GACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCC 
TGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGGTGGC 
AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGC 
AGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTA 
CCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTA 
CCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGA 
AGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCC 
CTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTA 
GT AC CAGTTGAGC C AGATAAGGT AGAAGAGGCCAATAAAGGAGAGAAC AC C AGCTTGGC ACACC CTGTGAG 
C CTGC ATGGAATGGATGACC CTGAGAGAGAAGTGTTAGAGTGGAGGTTTGAC AGC CGC CTAGC ATTTC AT C 
ACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTAGACTAGAG 
CCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCA 
TTGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAA 
GACCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAG 

CCGACAGGC CCGAAGGAATAA [SEQ ID NO: 74] 

Aminoacid sequence of insert 
MAEQLWVTVYYGVPVWKEATTTLFCASDAKAY 

VDQMHEDI I SLWDQSLKPCVKLTPLCVTLDCDDVNTTNSTTTTSlSrGWTGEIRKGE I KNCS FNITTS IRDKV 
QKEYALFYNLDVVP IDDDNATTKNKTTRNFRLIHCNS SVMTQACPKVS FEPI PIHYCAPAGFAILKCNNKT 
FDGKGLCTWSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPliraNT 
RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWNNTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 

NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCR 

LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMGARASVLSGG 
ELDRWEKIRLRPGGKKKYKLKHIWASRELERFAWPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 

ATLYCVHQR I E I KDTKE ALD KI EEEQNKS KKKAQQAAADTGHSNQVS QNYP I VQN I QGQMVHQAI S PRTLN 
AWVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKE3TINE 

PGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRD 
YVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKTIL 

KWSKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQV 
PLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKL 
VPVEPDKVEEANKGENTSLAHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLE 
PWKHPGSQPKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGE 

PTGPKE [SEQ ID NO: 75] 
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pRix43 




DNA sequence of insert 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 
CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 
CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 
GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 
TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 
CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 
CAGAAGGAATACGCGCTGTTTTATAATCT CGATGTGGTCCC C AT CGACGACGAC AATGC CAC CAC CAAGAA 
CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 
CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 
TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 
CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 
ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 
CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 
GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 
ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 
AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 
CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 
AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 
CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 
CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 
TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 
GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 
GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 
AGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTG 
GCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGA 
GGAGCAAAACAAGAGCAAGAAGAAGGC C CAGCAGGC AGCTGCTGAC ACTGGGCATAGCAAC CAGGTATCAC 
AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 
GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGA 
GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 
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TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 

CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 

ATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATTCTCGGTCTCAATA 

AAATTGTTAGAATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAAGAGCCTTTTAGGGAT 

TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 

GACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCC 

TGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGGTGGC 

AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGC 

AGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTA 

CCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTA 

CCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGA 

AGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCC 

CTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTA 

GTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCGCCGCACACCCTGTGAG 

CCTGCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATC 

ACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTAGACTAGAG 

CCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCA 

TTGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAA 

GACCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAG 

CCGACAGGCCCGAAGGAATAA [SEQ ID NO : 76] 

Arninoacid sequence of insert 
MAEQLWVTVYYGVPVWKEATTTLFCASDAKAYD 

VDQMHEDI I SLWDQSLKPCVKLTPLCVTLDCDDWTTNSTTTTSNGWTGE IRKGE IKNCSFNITTS IRDKV 
OKEYALFYWLDWPIDDDNATTKNKTTRNFRLIHCNSSVMTQACPKVSFEPIPIHYCAPAGFAILKCNNKT 
FDGKGLCTNVSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPNNNT 
RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWNNTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 
LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMGARASVLSGG 
ELDRWEKIRLRPGGKKKYKLKHIVWASRELERFAWPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 

ATLYCVHQRIEI KDTKE ALDKI EEEQNKS KKKAQQAAADTGHSNQVS QNYP I VQNI QGQMVHQAI SPRTLN 
AWVKWEEKAFS PEVI PMFS ALS EGATPQDLNTMLNTVGGHQAAMQMLKET INEE AAEWDRVHPVHAGP I A 
PGOMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRD 
YVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARVLMGG 
KWSKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQV 
PLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKL 
VPVEPDKVEEANKGEHTSAAHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLE 
PWKHPGSQPKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGE 

PTGPKE [SEQ ID NO: 77] 
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Figure 18 

pRix44 




DNA sequence of insert 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 
CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 
CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 
GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 
TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 
CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 
CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 
CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 
CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 
TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 
CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 
ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 
CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 
GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 
ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 
AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 
CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 
AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 
CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 
CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 
TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 
GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 
GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 
AGATC CTGGGGCAATTGC AGC C ATC C CTC CAGACCGGGAGTGAAGAGCTGAGGT C CTTGTATAACAC AGTG 
GCTAC CCTCTACTGCGT AC AC C AGAGGATCGAGATTAAGGATAC CAAGGAGGC CTTGGACAAAATTGAGGA 
GGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCAC 
AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 
GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGA 
GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 
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Figure 18 continued 

TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 
CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 
ATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATGTATAAACGGTGGATCATTCTCGGTCTCAATA 
AAATTGTTAGAATGTACTCTCCGACATCCATCGTTGACATTAGACAGGGACCCAAAGAGCCTTTTAGGGAT 
TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 
GACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCC 
TGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGGCAAG 
TGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGCAGA 
TGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCA 
ATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCT 
TTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGG 
GCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTG 
ATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTA 

CC AGTTGAGC CAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAAC AC C AGCGC CGC AC AC C CTGTGAGC CT 
GCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACG 

TGGC C CGAGAGCTGC ATC CGGAGTACTT CAAGAACTGCACTAGTGAGC C AGTAGATC CT AGACTAGAGC C C 
TGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTG 
CCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGAC 
CTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAGCCG 

AC AGGC C CGAAGGAATAA [SEQ ID NO : 78] 

Aminoacid sequence of insert 
MAEQLWVTVYYGVPVWKEATTT 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDWTTNSTTTTSNGWTGEIRKGEIKNCSFNITTSIRDKV 
QKEYALFYNLDWPIDDDNATTKNKTTRNFRLIHCNSSVMTQACPKVSFEPIPIHYCAPAGFAILKC1OTKT 

FDGKGLCTNVSTVQCTHGIRPVVSTQLLL^ 

RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWNNTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 
LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMGARASVLSGG 
ELDRWEKIRLRPGGKKKYKIiKHIVWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 

ATL YC VHQRIE I KDTKEALDKI EEEQNKS KKKAQQAAADTGHSNQVS QNYP I VQNI QGQMVHQ AI S PRTLN 

AWVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAE 

PGQMREPRGSDIAGTTSTLQEQIGWMTlsnsrPPIPVGEIYKRWIIIiGLNKIVRMYSPTSILDIRQGP 

YVDRFYKTLRAEQASQEVKNWMTETLLVQNANP^ 

WSKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVP 
LRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLV 
PVEPDKVEEANKGENTSAAHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEP 
WKHPGSQPKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGEP 

TGPKE [SEQ ID NO: 79] 
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Figure 19 

pRix46 




DNA sequence of insert 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 
CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 
CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 
GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 
TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 
CCGGAGAGATTCGGAAGGGCGAGATCAAGAA.CTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 
CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 
CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 
CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAA.GACC 
TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 
CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 
ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 
CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 
GGCC CATTGCAAC CTCT C C CGCGC C CAGTGGAATAACAC C CTGAAGC AGATCGTGATCAAGCTGAGAGAGC 
ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATGGTGCGGCACTCCTTC 
AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 
CAACAACACAGAGGGAAACT C C ACTAT C ACC CTC C CTTGC CGC ATCAAGC AGATC ATCAACATGTGGC AGG 
AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 
CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 
CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 
TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 
GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 
GTGGGCCTCGAGGGAGCTTGAACGGTTTGCGGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 
AGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTG 
GCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGA 
GGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCAC 
AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 
GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGA 
GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 
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Figure 19 continued 

TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 
CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 
ATGGATGACCAACAATGCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATTCTCGGTCTCAATA 
AAATTGTTAGAATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAAGAGCCTTTTAGGGAT 
TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 
GACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCC 
TGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGAGTGTTGATGGGCAAG 
TGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGCAGA 
TGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCA 
ATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCT 
TTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGG 
GCTAATT C ACTCCCAACGAAGACAAGAT ATC CTTGAT CTGTGGATCTAC C AC AC ACAAGGCTACTTC C CTG 
ATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTA 
CCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCGCCTTACACCCTGTGAGCCT 
GCATGGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACG 
TGGC CCGAGAGCTGC ATC CGGAGTACTT CAAGAACTGC ACTAGTGAGCCAGTAGATCCTAGACTAGAGC C C 
TGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTG 
CCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGAC 
CTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAGCCG 
ACAGGCCCGAAGGAATAA [SEQ ID NO : 80] 

Aminoacid sequence of insert 

ivla.eqlwvtvyygvpvwkeatttlf 

vdqmhed iis l wdq s lkp c vkltpl cvtldcdd vnttns ttttsngwtge irkge i knc s fmi tt s i rd kv 
qke yalf ynld wp idddnattknkttrmfrl ihcns s vmtqac pkvs fepip i h yc ap agfa i l kcnnkt 
fdgkglctnvstvqcthgirpvvstqlllngsl^ 

rkgihigpgrafyaarkiigdirqahcnlsraqi^™tlkqiviklrehfgnktikfnqssggdpeivrhsf 
ncggeffycdttqlfnstwngtegitotegnstitlp^^ 

lltrdggtegngteneteifrpgggdmrdnwrselykykwkveplgvaptrakrrwqrmgarasvlsgg 
eldrwekirlrpggkkkykiikhivwasrelerfavnpglletsegcrqilgqlqpslqtgseelrslyntv 
atl yc vhqrie i kdtke aldki eeeqnks kkkaqqaaadtghsnqvs qnyp i vqni qgqmvhqai s prtln 
awvkweekafspevipmfsalsegatpqdlntmlntvgghqaamqmlketineeaaewdrvhpvha 
pgqmreprgsdiagttstlqeqigwmtnnppipvgeiykrwiilglnkivrmysptsildirqgpkepfrd 
yvdrf yktlraeqas qe vknwmtetllvqnaot 

w s ks s wgw p t vre rmrrae p aad g vgaas rd l e khga i t s s ntaatnaac awl e aq e e e e vg f p vt p q vp 

lrpmtykaavdlshflkekggleglihsqrrqdildlwiyhtqgyfpdwqnytpgpgvrypltfgwcyklv 

pvepdkveeankgentsalhpvslhgmddperevlewrfdsrlafhhvarelhpeyfknctsepvdprlep 

wkhpgsqpktactncyckkccfhcqvcfitaalgisygrkkrrqrrrppqgsqthqvslskqptsqskgep 
tgpke [seq id no: 81] 
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Figure 20 

pRix47 




DNA sequence of insert 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 

CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 

CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 

GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 

TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 

CCGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATCACGACCTCGATCAGAGACAAGGTG 

CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 

CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 

CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 

TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 

CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 

ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 

CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 

GGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGAGAGAGC 

ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 

AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 

CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 

AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 

CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 

CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 

TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGA 

GAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGT 

GTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCC 

AGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTG 

GCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGA 
GGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTaTrar 

AGAACTATCCTATTGTCCAAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAAT 
GCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTOAGTra 

GGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGT 
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Figure 20 continued 

TGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCG 
CCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGG 
ATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATTCTCGGTCTCAATA 
AAATTGTTAGAATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAAGAGCCTTTTAGGGAT 
TACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGA 
GACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCC 

TGGAAGAGATGATGAC CGCCTGTCAGGGAGTAGGCGGAC CCGGAC ACAAAGC C AGAGTGTTGATGGGC AAG 
TGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGCAGA 
TGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCA 
ATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCT 
TTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGG 
GCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTG 
ATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTA 
CCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGGCACACCCTGTGAGCCT 
GC ATGGAATGGATGAC C CTGAGAGAGAAGTGTTAGAGTGGAGGTTTG AC AGC CGC CTAGCATTTC ATC ACG 
TGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTAGACTAGAGCCC 
TGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTG 
CCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGAC 
CTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAGCCG 

ACAGGCCCGAAGGAATAA [SEQ ID NO: 82] 

Aminoacid sequence of insert 
MAEQLWVTVYYGVPVWKEATTTLFCAS 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDVNTTNSTTTTSNGWTGEIRKGEIKNCSFNITTSIRDKV 
QKE YALFYNLD WPIDDDNATTKNKTTRNFRL IHCNS SVMTQACPKVS FEP IP IHYCAPAGFAILKCNNKT 
FDGKGLCTIWSTVQCTHGIRPWSTQL^^ 

RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWNlsrTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 
LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQRMGARASVLSGG 
ELDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 

ATLYCVHQR IE I KDTKE ALDKI EEE QNKS KKKAQQAAADTGHSNQVS QNYP IVQNI QGQMVHQAI S PRTLN 
AWVKWEEKAFSPEVIPMFSALSEGATPQDLNTMLNW^ 

PGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRD 

YVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKTILKAIaGPAATLEEMMTACQGVGGPGH 

WSKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVP 

LRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGWYPLTFGWCYKLV 

PVEPDKVEEANKGENTSLAHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEP 

WKHPGSQPKTACTNCYCKKCCFHCQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGEP 

TGPKE [SEQ ID NO: 83] 
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Figure 21 

pRix58 




DNA sequence of insert 

ATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCACCCTCTT 
CTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCGTGCCTA 
CGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAATAACATG 
GTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGACGCC 
TCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACGGCTGGA 
C CGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTCCTTCAATATC ACGAC CTCGAT CAGAGACAAGGTG 
CAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCACCAAGAA 
CAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCAAGGTGT 
CCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAACAAGACC 
TTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGTCGTGAG 
CACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCATGGACA 
ACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAACAACACC 
CGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACATCCGGCA 
GGC C C ATTGCAAC CTCTC C CGCGC C C AGTGGAATAACACC CTGAAGC AGATCGTGAT CAAGCTGAGAGAGC 
ACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCACTCCTTC 
AACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCACCGAGGG 
CAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGTGGCAGG 
AGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTG 
CTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCCCGGCGG 
CGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGCTCGGCG 
TGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGAATGGGCCCCATCAGTCCCATCGAGACCGTGCCG 
GTGAAGCTGAAAC C CGGGATGGACGGCCC CAAGGT CAAGC AGTGGC C ACT C ACCGAGGAGAAGATCAAGGC 
C CTGGTGGAGATCTGC AC CGAGATGGAGAAAGAGGGCAAGATC AGC AAGATCGGGC CTGAGAACCCAT AC A 
ACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACCAAGTGGCGCAAGCTGGTGGATTTCCGGGAGCTG 
AATAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGGGCATCCCCCATCCGGCCGGCCTGAAGAAGAAGAA 
GAGCGTGACCGTGCTGGACGTGGGCGACGCTTACTTCAGCGTCCCTCTGGACGAGGACTTTAGAAAGTACA 
CCGCCTTTACCATCCCATCTATCAACAACGAGACCCCTGGCATCAGATATCAGTACAACGTCCTCCCCCAG 
GGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCATGACCAAGATCCTGGAGCCGTTTCGGAAGCAGAA 
CCCCGATATCGTCATCTACCAGTACATGGACGACCTGTACGTGGGCTCTGACCTGGAAATCGGGCAGCATC 
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GCACGAAGATTGAGGAGCTGAGGCAGCATCTGCTGAGATGGGGCCTGACCACTCCGGACAAGAAGCATCAG 
AAGGAGCCGCCATTCCTGAAGATGGGCTACGAGCTCCATCCCGACAAGTGGACCGTGCAGCCTATCGTCCT 
CCCCGAGAAGGACAGCTGGACCGTGAACGACATCCAGAAGCTGGTGGGCAAGCTCAACTGGGCTAGCCAGA 
TCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTGCTGCGCGGCACCAAGGCCCTGACCGAGGTGATT 
CCCCTCACGGAGGAAGCCGAGCTCGAGCTGGCTGAGAACCGGGAGATCCTGAAGGAGCCCGTGCACGGCGT 
GTACTATGAC CCCTC C AAGGAC CTGATCGC CGAAATC CAGAAGC AGGGCC AGGGGCAGTGGAC ATAC C AGA 
TTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAGTACGCCCGCATGAGGGGCGCCCACACCAACGAT 
GTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACGACCGAGTCCATCGTGATCTGGGGGAAGACACCCAA 
GTTCAAGCTGCCTATCCAGAAGGAGACCTGGGAGACGTGGTGGACCGAATATTGGCAGGCCACCTGGATTC 
CCGAGTGGGAGTTCGTGAATACACCTCCTCTGGTGAAGCTGTGGTACCAGGTCGAGAAGGAGCCCATCGTG 
GGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCGCGAAACAAAGCTCGGGAAGGCCGGGTACGTCAC 
CAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGACACCACCAACCAGAAGACGGAGCTGCAGGCCATCT 
ATCTCGCTCTCCAGGACTCCGGCCTGGAGGTGAACATCGTGACGGACAGCCAGTACGCGCTGGGCATTATT 
CAGGC CCAGC CGGAC C AGTCCGAGAGCGAACTGGTGAAC C AGATTATCGAGC AGCTGATCAAGAAAGAGAA 
GGTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTGGCGGCAACGAGCAGGTCGACAAGCTGGTGAGTG 
CGGGGATTAGAAAGGTGCTGATGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTAC 
AAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAG 
AAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACAC 
CAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAG 
GTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGGATGGATGACCC 
GGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATC 
CGGAGTACTTCAAGAACTGCATGGGTGCCCGAGCTTCGGTACTGTCTGGTGGAGAGCTGGACAGATGGGAG 
AAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAAGCTCAAGCATATCGTGTGGGCCTCGAGGGAGCT 
TGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACATCTGAGGGATGTCGCCAGATCCTGGGGCAATTGC 
AGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCCTTGTATAACACAGTGGCTACCCTCTACTGCGTA 
CACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTTGGACAAAATTGAGGAGGAGCAAAACAAGAGCAA 
GAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATAGCAACCAGGTATCACAGAACTATCCTATTGTCC 
AAAACATTCAGGGCCAGATGGTTCATCAGGCCATCAGCCCCCGGACGCTCAATGCCTGGGTGAAGGTTGTC 
GAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTTCTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGA 
CCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGGCCGCCATGCAAATGTTGAAGGAGACTATCAACG 
AGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCACGCTGGCCCAATCGCGCCCGGACAGATGCGGGAG 
CCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACTGCAAGAGCAAATCGGATGGATGACCAACAATCC 
TCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCATCCTGGGCCTGAACAAGATCGTGCGCATGTACT 
CTCCGACATCCATCCTTGACATTAGACAGGGACCCAAAGAGCCTTTTAGGGATTACGTCGACCGGTTTTAT 
AAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAAAAACTGGATGACGGAGACACTCCTGGTACAGAA 
CGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAGGCCCGGCTGCCACCCTGGAAGAGATGATGACCG 
C CTGTCAGGGAGTAGGCGGAC C CGGAC ACAAAGC CAGAGTGTTGTAA [SEQ ID NO : 84] 

Aminoacid sequence of insert 

MAEQLWVTVYYGVPWKEATTTLFCAS 

VDQMHEDIISLWDQSLKPCVKLTPLCVTLDCDDVNTTNSTTTTSNGWTGEIRKGEIKNCSFMITTSIRDKV 
QKEYALFYNLDWPIDDDNATTKNKTTRNFRLIHCNSSVMTQACPKVSFEPIPIHYCAPAGFAILKCNNKT 
FDGKGLCTIWSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPNIOTT 
RKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQWNNTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRHSF 
NCGGEFFYCDTTQLFNSTWNGTEGNNTEGNSTITLPCRIKQIINMWQEVGKAMYAPPIGGQIRCSSNITGL 
LLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSEIiYKYKWKVEPLGVAPTRAKRRWQRMGPISPIETVP 
VKLKPGMDGP KVKQWPLTEEKI KALVE I CTEMEKEGKI S KI GPENP YNTP VFAI KKKDSTKWRKLVD FREL 

NKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQ 
GWKGSPAIFQSSMTKILEPFRKQNPDIVIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGLTTPDKKHQ 
KEPPFLKMGYELHPDKWTVQPIVLPEKDSWTV^ 

PLTEEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHTND 
VKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEFWTPPLVKLWYQLEKEPIV 
GAETFYVDGAANRETKLGKAGYVTMRGRQKWTLTDTTNQKTELQAIYLALQDSGLEWIVTDSQYALGII 
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QAQPDQSESELWQIIEQLIKKEKVYLAWVPAHKGIGGNEQVDKLVSAGIRKVLMVGFPVTPQVPLRPMTY 

KAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDK 

VEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCMGARASVLSGGELDRWB 

KIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTVATLYCV 

HQRIEIKDTKEALDKIEEEQNKSKKKAQQAAADTGHSNQVSQNYPIVQNIQGQMVHQAISPRTLNAWVKW 

EEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAEWDRVHPVHAGPIAPGQM 

PRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPKEPFRDYVDRFY 

KTLRAEQASQEVK3STWMTETLLVQNANPDCKTILI<^LGPAATLEEMMTACQGVGGPGHKARVL [SEQ ID 

NO: 85] 



37/53 



WO 2004/041852 



PCT/EP2003/012402 



Figure 22 

pRix59 




DNA sequence of insert 

ATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGGATGGACGGCCCCAAGGTCAA 

GCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTGGTGGAGATCTGCACCGAGATGGAGAAAGAGGGCA 

AGATCAGCAAGATCGGGCCGGAGAACCCATACAACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACC 

AAGTGGCGCAAGCTGGTGGATTTCCGGGAGCTGAATAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGGG 

CATCCCCCATCCGGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGCTGGACGTGGGCGACGCTTACTTCA 

GCGTCCCTCTGGACGAGGACTTTAGAAAGTACACCGCCTTTACCATCCCATCTATCAACAACGAGACCCCT 

GGCATCAGATATCAGTACAACGTCCTCCCCCAGGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCAT 

GACCAAGATCCTGGAGCCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATGGACGACCTGT 

ACGTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAAGATTGAGGAGCTGAGGCAGCATCTGCTGAGA 

TGGGGCCTGACCACTCCGGACAAGAAGCATCAGAAGGAGCCGCCATTCCTGAA.GATGGGCTACGAGCTCCA 

TCCCGACAAGTGGACCGTGCAGCCTATCGTCCTCCCCGAGAAGGACAGCTGGACCGTGAACGACATCCAGA 

AGCTGGTGGGCAAGCTCAACTGGGCTAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTG 

CTGCGCGGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAGCTCGAGCTGGCTGAGAA 

CCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGTACTATGACCCCTCCAAGGACCTGATCGCCGAAATCC 

AGAAGCAGGGCCAGGGGCAGTGGACATACCAGATTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAG 

TACGCCCGCATGAGGGGCGCCCACACCAACGATGTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACGAC 

CGAGTCCATCGTGATCTGGGGGAAGACACCCAAGTTCAAGCTGCCTATCCAGAAGGAGACCTGGGAGACGT 

GGTGGACCGAATATTGGCAGGCCACCTGGATTCCCGAGTGGGAGTTCGTGAATACACCTCCTCTGGTGAAG 

CTGTGGTACCAGCTCGAGAAGGAGCCCATCGTGGGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCG 

CGAAACAAAGCTCGGGAAGGCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGACA 

CCACCAACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCTCTCCAGGACTCCGGCCTGGAGGTGAACATC 

GTGACGGACAGCCAGTACGCGCTGGGCATTATTCAGGCCCAGCCGGACCAGTCCGAGAGCGAACTGGTGAA 

CCAGATTATCGAGCAGCTGATCAAGAAAGAGAAGGTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTG 

GCGGCAACGAGCAGGTCGACAAGCTGGTGAGTGCGGGGATTAGAAAGGTGCTGATGGTGGGTTTTCCAGTC 

GGGGGGACTGGAAGGGCTAATTCACTCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACAC 

AAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTTGGATGG 
^TACAAGCTAGra^^ 

ACACCCTGTGAGCCTGCATGGGATGGATGACCCGGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCC 
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TAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCATGGGTGCCCGAGCTTCG 
GTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAA 
GCTCAAGCATATCGTGTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACAT 
CTGAGGGATGTCGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCC 
TTGT ATAAC ACAGTGGCTAC C CTCTACTGCGTAC AC C AGAGGATCGAGATTAAGGATAC CAAGGAGGC CTT 
GGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATA 
GCAAC C AGGT ATC AC AGAACTATC CTATTGT C CAAAAC ATT CAGGGC C AGATGGTTCAT CAGGC C ATC AGC 
CCCCGGACGCTCAATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTT 
CTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGG 
CCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCAC 
GCTGGCCCAATCGCGCCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACT 
GC AAGAGCAAATCGGATGGATGACC AACAATC CT C C C ATC C CAGTTGGAGAAATCT ATAAACGGTGGATC A 
TCCTGGGCCTGAACAAGATCGTGCGCATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAA 
GAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAA 
AAACTGGATGACGGAGACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAG 
GCCCGGCTGCCACCCTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 
GTGTTGATGGCCGAGCAGCTGTGGGTCACCGTCTACTACGGCGTGCCTGTGTGGAAGGAGGCCACGACCAC 
CCTCTTCTGCGCGAGCGACGCCAAGGCCTACGACACGGAAGTGCATAACGTGTGGGCGACGCATGCTTGCG 
TGCCTACGGACCCCAACCCCCAGGAGGTGGTGCTGGGAAACGTGACCGAGTACTTCAACATGTGGAAGAAT 
AACATGGTGGATCAGATGCACGAGGACATCATCTCTCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCT 
GACGCCTCTCTGCGTGACACTGGACTGTGACGACGTCAACACCACCAACAGCACTACCACCACCAGCAACG 
GCTGGAC CGGAGAGATTCGGAAGGGCGAGATCAAGAACTGCTC CTT C AATATCACGAC CTCGATC AGAGAC 
AAGGTGCAGAAGGAATACGCGCTGTTTTATAATCTCGATGTGGTCCCCATCGACGACGACAATGCCACCAC 
CAAGAACAAGACGACGCGTAATTTCAGACTCATTCACTGCAACAGCAGCGTCATGACGCAGGCCTGCCCCA 
AGGTGTCCTTCGAACCAATCCCGATCCATTACTGTGCCCCTGCCGGATTCGCGATCCTCAAGTGTAACAAC 
AAGACCTTCGACGGGAAGGGCCTGTGCACCAACGTCAGCACGGTGCAGTGCACCCATGGCATCCGCCCCGT 
CGTGAGCACCCAGCTGCTGCTGAACGGGTCCCTGGCTGAGGAGGAGGTGGTGATCCGGTCGGACAACTTCA 
TGGACAACACCAAGACAATCATCGTCCAGCTGAACGAGTCTGTGGCGATTAACTGTACCCGGCCTAACAAC 
AACACCCGTAAGGGCATCCACATCGGGCCTGGACGGGCCTTCTATGCCGCCCGCAAGATCATCGGCGACAT 
CCGGCAGGCCCATTGCAACCTCTCCCGCGCCCAGTGGAATAACACCCTGAAGCAGATCGTGATCAAGCTGA 
GAGAGCACTTTGGAAACAAGACCATCAAGTTCAATCAGAGTTCTGGCGGAGACCCCGAGATCGTGCGGCAC 
TCCTTCAACTGCGGGGGCGAGTTCTTCTACTGCGATACGACACAGCTCTTCAACTCCACCTGGAACGGCAC 
CGAGGGCAACAACACAGAGGGAAACTCCACTATCACCCTCCCTTGCCGCATCAAGCAGATCATCAACATGT 
GGCAGGAGGTGGGAAAGGCCATGTATGCCCCCCCCATCGGGGGCCAGATCCGCTGCTCCTCCAACATCACC 
GGCCTGCTGCTCACCAGAGACGGGGGCACCGAGGGCAACGGCACGGAGAACGAGACGGAGATCTTCAGGCC 
CGGCGGCGGCGACATGAGGGATAACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTGAAGGTGGAGCCGC 
TCGGCGTGGCCCCCACCCGGGCCAAGCGCCGCGTCGTGCAGAGATGA [SEQ ID NO: 86] 

Aminoacid sequence of insert 

MGPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNTPVFAIKKKDST 
KWRKLVD FRELNKR^ 

GIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDIVIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLR 
WGLTTPDKK15QKEPPFLKMGYELHPDKWTVQPIVLP 

LRGTKALTEVIPLTEEi^LELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGK 
YARMRGAHTNDVKQLTEAVQKITTESIVIWGKTPK^ 
LWYQLEKEPIVGAETFYVDGAANRETKLGKAGYV^ 
VTDSQYALGIIQAQPDQSESELWQIIEQLIKKEK^ 

TPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGW 
CYKLVPVEPDK^EEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCMGARAS 
VLSGGELDRWEKIRLRPGGKKKYKLKHIWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELiRS 
L YNTVATL YCVHQR I E I KDTKEALD KI EEEQNKS KKKAQQAAADTGHSNQVS QNYP I VQN I QGQMVHQAI S 
PRTLNAWVKWEEKAFSPEVIPMFSALSEGATPQDLNTML^ 

AGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTimPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPK 
EPFRDYVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGG 
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VLMAEQLWVTVYYGVPVWKEAT 
NMVDQMHEDIISLWDQSLKPCVKL 

KVQKE YALF YNLDWP IDDDNATTKNKTTRNFRL IHCNS S VMTQACPKVS FEP I PIHYCAPAGFAILKCNN 
KTFDGKGLCTWSTVQCTHGIRPWSTQLLLNGSLAEEEWIRSDNFMDNTKTIIVQLNESVAINCTRPim 

NTRKGIHIGPGRAFYAARKIIGDIRQAHCNLSRAQW1OTTLKQIVIKLREHFGNKTIKFNQSSGGDPEIVRH 
SFNCGGEFFYCDTTQLFNSTW^^ 

GLLLTRDGGTEGNGTENETEIFRPGGGDMRDNWRSELYKYKWKVEPLGVAPTRAKRRWQR [ SEQ ID 
NO: 8 7] 
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Figure 23 

pRix50 




DNA and amino acid sequences of inserts: 
Identical to pNTm and pRixl2 
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Figure 24 

pRix51 




DNA and amino acid sequences of inserts: 
Identical to p73I-GN2 and pRixl2 



42/53 



WO 2004/041852 



PCT/EP2003/012402 



Figure 25 

pRix53 




DNA and amino acid sequences of inserts: 
Identical to pRix52 and pRixl2 
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Figure 26 

pRix54 




DNA and amino acid sequences of inserts: 
Identical to pT-RNG and pRixl2 
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Figure 27 

pRix60 




DNA and amino acid sequences of inserts: 
Identical to pRix!2 and pT-rng 
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Figure 28 

pT-RNG 




DNA sequence of insert: 

ATGGGCCCCATCAGTCCCATCGAGACCGTGCCGGTGAAGCTGAAACCCGGGATGGACGGCCCCAAGGTCAA 

GCAGTGGCCACTCACCGAGGAGAAGATCAAGGCCCTGGTGGAGATCTGCACCGAGATGGAGAAAGAGGGCA 

AGATCAGCAAGATCGGGCCTGAGAACCCATACAACACCCCCGTGTTTGCCATCAAGAAGAAGGACAGCACC 

AAGTGGCGCAAGCTGGTGGATTTCCGGGAGCTGAATAAGCGGACCCAGGATTTCTGGGAGGTCCAGCTGGG 

CATCCCCCATCCGGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGCTGGACGTGGGCGACGCTTACTTCA 

GCGTCCCTCTGGACGAGGACTTTAGAAAGTACACCGCCTTTACCATCCCATCTATCAACAACGAGACCCCT 

GGCATCAGATATCAGTACAACGTCCTCCCCCAGGGCTGGAAGGGCTCTCCCGCCATTTTCCAGAGCTCCAT 

GACCAAGATCCTGGAGCCGTTTCGGAAGCAGAACCCCGATATCGTCATCTACCAGTACATGGACGACCTGT 

ACGTGGGCTCTGACCTGGAAATCGGGCAGCATCGCACGAAGATTGAGGAGCTGAGGCAGCATCTGCTGAGA 

TGGGGCCTGACCACTCCGGACAAGAAGCATCAGAAGGAGCCGCCATTCCTGAAGATGGGCTACGAGCTCCA 

TCCCGACAAGTGGACCGTGCAGCCTATCGTCCTCCCCGAGAAGGACAGCTGGACCGTGAACGACATCCAGA 

AGCTGGTGGGCAAGCTCAACTGGGCTAGCCAGATCTATCCCGGGATCAAGGTGCGCCAGCTCTGCAAGCTG 

CTGCGCGGCACCAAGGCCCTGACCGAGGTGATTCCCCTCACGGAGGAAGCCGAGCTCGAGCTGGCTGAGAA 

CCGGGAGATCCTGAAGGAGCCCGTGCACGGCGTGTACTATGACCCCTCCAAGGACCTGATCGCCGAAATCC 

AGAAGCAGGGCCAGGGGCAGTGGACATACCAGATTTACCAGGAGCCTTTCAAGAACCTCAAGACCGGCAAG 

TACGCCCGCATGAGGGGCGCCCACACCAACGATGTCAAGCAGCTGACCGAGGCCGTCCAGAAGATCACGAC 

CGAGTCCATCGTGATCTGGGGGAAGACACCCAAGTTCAAGCTGCCTATCCAGAAGGAGACCTGGGAGACGT 

GGTGGACCGAATATTGGCAGGCCACCTGGATTCCCGAGTGGGAGTTCGTGAATACACCTCCTCTGGTGAAG 

CTGTGGTACCAGCTCGAGAAGGAGCCCATCGTGGGCGCGGAGACATTCTACGTGGACGGCGCGGCCAACCG 

CGAAACAAAGCTCGGGAAGGCCGGGTACGTCACCAACCGGGGCCGCCAGAAGGTCGTCACCCTGACCGACA 

CCACCAACCAGAAGACGGAGCTGCAGGCCATCTATCTCGCTCTCCAGGACTCCGGCCTGGAGGTGAACATC 

GTGACGGACAGCCAGTACGCGCTGGGCATTATTCAGGCCCAGCCGGACCAGTCCGAGAGCGAACTGGTGAA 

CCAGATTATCGAGCAGCTGATCAAGAAAGAGAAGGTCTACCTCGCCTGGGTCCCGGCCCATAAGGGCATTG 

GCGGCAACGAGCAGGTCGACAAGCTGGTGAGTGCGGGGATTAGAAAGGTGCTGATGGTGGGTTTTCCAGTC 

ACACCTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAA 

GGGGGGACTGGAAGGGCTAATTCACTCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACAC 

aaggctacttccctgattggcagaactacacaccagggccaggggtcagatatccactgacctttggatSg 

TGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTT 
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ACACCCTGTGAGCCTGCATGGGATGGATGACCCGGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCC 
TAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCATGGGTGCCCGAGCTTCG 
GTACTGTCTGGTGGAGAGCTGGACAGATGGGAGAAAATTAGGCTGCGCCCGGGAGGCAAAAAGAAATACAA 
GCTCAAGCATATCGTGTGGGCCTCGAGGGAGCTTGAACGGTTTGCCGTGAACCCAGGCCTGCTGGAAACAT 
CTGAGGGATGTCGCCAGATCCTGGGGCAATTGCAGCCATCCCTCCAGACCGGGAGTGAAGAGCTGAGGTCC 
TTGTATAACACAGTGGCTACCCTCTACTGCGTACACCAGAGGATCGAGATTAAGGATACCAAGGAGGCCTT 
GGACAAAATTGAGGAGGAGCAAAACAAGAGCAAGAAGAAGGCCCAGCAGGCAGCTGCTGACACTGGGCATA 
GCAACCAGGTAT CAC AGAACTATC CTATTGTC CAAAACATTC AGGGC C AGATGGTTCATCAGGC C ATCAGC 
CCCCGGACGCTCAATGCCTGGGTGAAGGTTGTCGAAGAGAAGGCCTTTTCTCCTGAGGTTATCCCCATGTT 
CTCCGCTTTGAGTGAGGGGGCCACTCCTCAGGACCTCAATACAATGCTTAATACCGTGGGCGGCCATCAGG 
CCGCCATGCAAATGTTGAAGGAGACTATCAACGAGGAGGCAGCCGAGTGGGACAGAGTGCATCCCGTCCAC 
GCTGGCCCAATCGCGCCCGGACAGATGCGGGAGCCTCGCGGCTCTGACATTGCCGGCACCACCTCTACACT 
GCAAGAGCAAATCGGATGGATGACCAACAATCCTCCCATCCCAGTTGGAGAAATCTATAAACGGTGGATCA 
TCCTGGGCCTGAACAAGATCGTGCGCATGTACTCTCCGACATCCATCCTTGACATTAGACAGGGACCCAAA 
GAGCCTTTTAGGGATTACGTCGACCGGTTTTATAAGACCCTGCGAGCAGAGCAGGCCTCTCAGGAGGTCAA 
AAACTGGATGACGGAGACACTCCTGGTACAGAACGCTAACCCCGACTGCAAAACAATCTTGAAGGCACTAG 

GCCCGGCTGCCACCCTGGAAGAGATGATGACCGCCTGTCAGGGAGTAGGCGGACCCGGACACAAAGCCAGA 
GTGTTGTAA [SEQ ID NO : 8 8] 

Amino acid sequence of insert: 

MGP I S P I ETVP VKLKPGMDGP KVKQ WPLTEEKI KAL VE I CTEME KEGKI S KI GPENP YNTP VFAI KKKDST 
KWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSV^ 

GIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDIVIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLR 
WGLTTPD KKHQKE PP FL KMG YELHPD KWTVQP I VLPE KD S WT VND I QKL VGKLNWAS Q I YPG I KVRQL CKL 
LRGTKALTEVI PLTEE AELELAENRE ILKEPVHGVYYDPS KDL I AE I QKQGQGQWTYQI YQEPFKNLKTGK 
YARMRGAHTNDVKQLTEAVQKIT^ 

LWYQLEKEPIVGAETFYVDGAANRETKLGKAGYVTKTRGRQKW 

VTDSQYALGIIQAQPDQSESELWQIIEQLIKKEKVYLAWVPAHKGIGGNEQVDKLVSAGIRKVLMVGFPV 
TPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGW 
CYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCMGARAS 

VLSGGELDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSEELRS 
L YNTVATL YCVHQRIE I KDTKE ALDKIEEEQNKS KKKAQQAAADTGHSNQVS QN YP I VQNI QGQMVHQAI S 
PRTLNAWVKVVEEKAFSPEVIPMFSALSEGATPQDL^ 

AGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIRQGPK 

EPFRDYVDRFYKTLRAEQASQEVKNWMTETLLVQMANPDCKTILKALGPAATLEEMMTACQGVGGPGH 
VL [SEQ ID NO: 8 9] 
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Figure 29 

A schematic representation of the constructs and associated expression data is shown 
below 
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Figure 30 

A schematic representation of further constructs and associated expression data: 
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anti-Nef antisera 



••■'ft "„ . ft?': ' : '■ '■ "Ki : .;"*.!%;•- ' ■'■'?.Xi}:-i Y^iv'z : ' ? -j 



175-r 



83- 




624 
47.5- fer 



J- V"- x* 



32.5- li 



m bo o4 tj- o 

< pj h in 

cn V xj xj x 

^ Qh Ph Oh 



Probed with 
anti-gpl20 antisera 
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Western blots probed with anti-nef (left) or anti-gpl20 (right) antisera showing the 
expression of RNG and dsgpl20 from dual promoter and single vectors. 



Plasmid schematics: 
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Figure 31 

Expression data (anti-Nef) for dsgpl20/Gag/Nef/Tat fusions with mutations in Nef CnRix 
40-47) 
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Expression data (anti-Nef and anti-gpl20) for dual promoter vectors 
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Figure 32 



Responses to Gp120 peptide by IFN-gamma ELIspot at 7 days post-boost (Day 35) 
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Figure 33 

Responses to gp120 peptide by IFN-gamma ELIspot at 7 days post-boost (Day 35) 
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Figure 34 
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